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"There Is no use trying," she said: "One can't 
believe 1«|)oss1bTe things." "I daresay you 
haven't had nwch practice," said the Queen, 

"When I was your age, I alw^s did It fmr half* 
an-hour a day. Why, sometimes I've believed as 
many ^ six Impossible thinsp before breakf^t." 

—Lewis Carroll 

Preface 

For at least a decade It has been recognized that any truly effective 
search of the radio frequency band for extraterrestrial signals would require 
a signal detector system of unprecedented resolution and sensitivity. Unfcnr* 
tunately, all searches to date have relied upon rather conventional detection 
equipment, and thus have not been able to probe deeply Into our stellar 
neighborhood. 

A great Impetus for the search for extraterrestrial intelligence (SETI) 
effort cane In 1973 in the form of a NASA- Ames Sianmer Study (tesign of a large 
scale search system. This system, called Project Cyclops, gains sensitivity 
through linking a great nianber of telescopes of mocterate size. The study 
demonstrated the feasibility of a full-scale SETI program to begin In the 
near future. 

In addition to Project Cyclops, the rapid advance of digital electronics 
In the 70's also led to Increased Interest In SETI. For the first time, 
trough the development of Innovative FFT algorithms and fast microprocessors, 
an 8-m1 111 on-channel spectrum analyzer (NCSA) was designed to meet the needs 
of a SETI program. The NCSA puts out a very large database at very high r-'tes. 
The development of the device which follows the NCSA, called the Signal 
Detector, represents a major design challenge In both data processing and 
algorithm optimization. The design of this device Is the subject of this 
report. 


The stu(^ was sponsored by the American Society for Engineering 
Education (ASEE) and the National Aeronautics and Space Administration (NASA). 
The host institutions were the University of Santa Clara and the NASA-Ames 
Research Center. The 1979 Summer Study brought together 22 professional 
educators In the fields of physics, statistics, electrical engineering, and 
radio astronoRiy. During this summer they designed an Integrated system of 
a new and original type which can be built with today's technology. 

As the study commenced the group was presented with two very difficult 
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pn^lens. They were; "How does one process l/2-terab1ts (10 bits) of 
data In 1000 seconds," and "How does one detect a conpletely unspecified 
signal with acceptable sensitivity?" 

Fortunately, the study team rose to the diallenge, and an Intense period 
of learning, groping, and Imagining followed. Proposed solutions emerged, 
each with Its small camp of supporters. Adding fuel to ensuing debates 
was the fact that for an algorithm to be acceptd>le it was required to be 
both sereltive and feasible (Implementable). This requirement brought 
together hardware specialists and mathematical theorists. Algorithm makers 
conferred with data handlers. Outside specialists were frequently brought 
In to determine if a product could truly be produced. All candidate algo- 
rithms, such as Pattern Recognition, Analysis of Variance, and Coherence 
Measure were subjected to a makeshift scientific forum, where the participants 
learned to speak the language of signal-to-nolse ratios and bit rates. 
Concurrent with this debate was the search for techno1(^1es commensurate to 
the massive quantities of processing required. New and exotic technologies 
(IncliKlIng optical laser storage, suf^r computers, rmjltl processing computer 
arrays) were explored. Again concurrent with these studies was one by the 



psychologists of the group regarding the htenan brain as the most sophisti- 
cated "algorithm processor" of all. It was a period of Intense inter- 
disciplinary learning with statisticians developing Integrated circuits 
and physicists studying human response to stimuli, and with each person 
periodically staring Into noise-dominated display screens attempting to 
discern a hidden signal. 

This period of open scientific query was ultimately channelled Into a 
period of determined engineering design and layout, where all chosen methods 
were Incorporated and Integrated. It Is somewhat sad that so many promising 
and exciting areas of research had to be eventually constrained to definitive 
dry statements and diagrams within this report, but It was felt that one 
well -sped fled system was needed which served to Incorporate this broad 
scope of Ideas into one. 

We wish to say a word about the use of con^uters by the study 
participants. Individuals within the group produced simulations which: 

- determined optimal ways of integrating drifting carrier 
waves* 

- tested Intelligent algoritNns to follow drifting carrier 
waves 

- detemlned the sensitivity of cc^erence measure and shift 
SNRs 

- showed the effect of quantization on variance algorithms 

- developed 3-D display techniques for human pattern 
recognition 

- demonstrated the ability of himans to recognize patterns 

- demonstrated automated pattern and cluster recognition 


* 


explained In the body of this report 


Such sitrajiatlons with ccmputers and with humans aided by computers 
helped greatly In gaining an evaluation of beliefs. Simulation of the 
various components of the system Is an economical and effective way of 
evaluating and expanding the Ideas proposed in this report. 

This study proposes a system that acts as a detector of intelligent 
signals. It combines the thinking of physicists, statisticians, 
electrical engineers and radio astronomers Into one Integrated design 
constructable with today's technology. While we do not think that this 
study Is the final answer In temms of Instrumentation and algorithms In 
tile search for extraterrestrial life, we do feel that many of these 
proposals are rich with potential and should be exploited further. 

A word about the choice of the name for the study— Project Oasis. One 
of the major candidate regions of the electromagnetic spectrum In which to 
search 1s the range from 1400 MHz to 1700 Miz. Near the low end of this 
band is the spectral line of hydrogen. Near the upper end Is the line for 
hydroxyl radical. These two lines of the dissociation products of water 
gave rise to the suggestion that this narrow band of frequencies, where 
nature passes signals with relative ease, be called the “water hole," and 
Indeed what better place might we search than that ancient meeting ground 
of so many disparate species of living creatures. It was this allusion to 
a water hole which suggested the word Oasis, a patch of green In a vast 
expanse of arid desert, with Its water hole providing the source of life. 

We have each come from this sunmer project realizing the power of 
collaboration and cooperation between differing disciplines and viewpoints. 

This study Is the product of a group of Individuals who worked together and 
reached beyond their limits to break new ground, just as man imist search beyond 
himself towards the limits of space If he Is to reach the stars. 
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GLOSSARY 


A scan refers to one spectrum>-8 Niz x 1 Hz, with 16 bits per channel 
produced In om second. 

The entire array refers to 1000 seccmds of the MCSA output and Is a 1000 
sec X G Hhz matrix of raw data, (10(K) scars). 

A block Is a 20 sec x 40 Hz secticm of the entire array, BOO points. 

The compacted array is a 50 x 250,0(K} matrix of 9-b1t numbers. 

A pattern blyk Is a 50 x 200 matrix of 9-b1t numbers whidi Is a vertical 
strip of the compacted array. 

A pattern class is a set of 9-bit numbers with specified bits ^ual to 1 
whidi determine the class and no constraint on the urspecifled bits. 
Thus, both (0, 1, 1, 1,0, 1,0, 1, 1) and (1, 1. 0.1, 1, 1, 0. 1, 0) 
belong to the class (♦, 1. *, 1, *. *. *, 1) . And also to the class 
(*. 1. *, *. *. *, *). while (0. 0, 1. 1, 0, 1. 0, 1. 1) belongs 

to neither of these classes. 

A pattern point is any 9-bit nunber of the contacted array. 

A cell is the t1me-freqi«ncy location of a pattern point. Note the one-to- 
one correspondence between blocks and cells. 

In addition, the following mnemonics are used wictely throughout the report: 

AC - Accumulator 

ANOVA - Analysis of Variance 

CD - Cluster Detector 

CFSR - Coirr.urt Frame of Semantic Reference 

CWD - Carrier Wave Detector 

ETI - The Extraterrestrial Intelligence 

6CV - Generalized Coherence Value 

HCSA - Multi-Channel Spectrum Analyzer 

l€IT - Numerical Battery of Independent Tests 

OSD - Oasis Signal Detector 

PD - Pulse Detector 

PROM - Programmable Read-Only Memory 

RFI - Radio Frequency Interference 

RAM - Random Access Memory 


ROM - Read-Only Memory 

SET I - Search for Extraterrestrial Intel ligmce 
SNR - Signal to Noise Ratio 

Obit - Gigabit. 10^ bits 
Mbit - Neg^lt. 10^ bits 
Byte - 8 bits 



Chapter 1 
Introduction 


The Search for Extraterrestrial Intelligence (SET I) 



You iMiy seek It with thimbles— and seek It with care; 
You may hunt It with forks and h<^; 

You may threaten Its life with a railway share; 

You may charm It with smiles and soap— 

We have sailed many months, we have sailed many weeks, 
(Four weeks to the month you may mark). 

But never as yet ('tis your captdin who speaks) 

Have we had the least glimpse of a Snark! 

—Lewis Carroll, The Hunting of the Snark 

Because of continuing advances and discoveries made by astronomers In the 
"outer" universe and by microbiologists In the "Inner" universe, our estimates 
of the probability of life being widespread In the universe are growing rapidly. 

At the same time, technological advances are Increasing the probability of our 
being able to succ^sfully communicate over the large Interstellar distances 
that might exist between adjacent Intelligent civilizations. That this Idea 
has clearly energed from the realm of science fiction Into that of science 
fact may be seen from a statement made by the National Academy of Sciences In 
their Recommendations for Astronomy In the 1970 's. 

"Our civilization is within reach of one of the greatest steps In Its 
evolution: knowledge of the existence, nature and activities of Independent 
civilizations In space. At this Instant, through this very document, are 
perhaps passing radio waves bearing the conversations of distant creatures— 
conversations that we could record If we but pointed a telescope In the right 
direction and tuned to the proper frequency. 

"More and more scientists feel Wrat contact with other civilizations Is no 
longer something beyond our dreams but a natural event In the history of mankind 
that will perhaps occur In the lifetime of rnary of us. The promise Is now too 
great, either to turn away from It or to wait much longer before devoting 
major resources to a search for other Intelligent beings. 
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"In the long run this mey be one of science's most Is^rtent and 
Rest prof(Kind contributions to mankind and to mir civilization." 

1.1 “ The Evidence f<MP Life 

Radio Astronomers have found Increasingly large organic molecules 
Mithin gas and dust clouds distributed throughout (Mr gala}^. Nolecuitt 
such as formaldehyde, ethyl alcohol, formmnlite, and outers with molecular 
weights up to 123 have been found. 

We have artificially created the building blocks of 11feo*am1no acids, 
nucleic acids and primitive proteins— In our laboratories by supplying 
energy to simple mixtures of water, mrmonla, and methane. The energy has 
been successfully supplied by a great variety of sources, such as electric 
spark, ultraviolet radiation, heat, acoustic waves, x-rays, and nuclear 
particle bombardment (Ponnamperuna, 1972). Since these single substances 
and energy sources are likely to be present on any newly formed planet, the 
precursors of life probWbly are widespread, and although we have no direct 
evidence that further development or evolutirwi will necessarily occur. It Is 
certainly a possibility. These same building blocks have also been found 
In moon dust and In meteorites that have landed on the Earth. This Indicates 
that Earth has no monopoly on the organic con^cxinds necessary for life as 
we know It. 

In all our studies of nature, we have yet to encounter a truly unique 
object or event. Nature tends to create things of marry kinds, but each kind 
Is well -populated. We find the same elements In our Sun and In all the other 
stars and In all the distant galaxies as we do right here on Earth. We 
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observe that the same laws of physics hold everyvd^re In the universe. 

The Sun and the Earth are but average m«it>er$ of our gala)^. There Is no 
basis to believe the Earth Is In any sense unique. 

1.2 - Where Is Extraterrestrial Life to Be Found? 

Our space probes to date have Indicated little hope for life on the 
Hoon, Iters, or Venus, and conditions on the remaining planets of our solar 
system seem hostile to any life. Moving mit to the stars In our galaiQf, 
we can expect to find life-bearing planets associated with only certain 
types of parent stars. 

The parent star should not be t(X) large, since such stars bum them- 
selves out In too short a time for intelligent life to evolve. Nor should 
It be too small, since the planet then has to be so close to get enough heat 
that tidal effects tend to lock the planet with one side always facing the 
star, similar to Mercury and the Moon. This causes all the atmosphere 
to freeze out onto the cold side. The parent star should be a slow rotator. 
Indicating that It may have lost im^t of its Initial angular momentum to a 
planetary system. In our solar system, 98X of the total angular momentian 
Is carried by the planets, and the Sun Is a very typical slow rotator. 

The heavy elements necessary for life are formed only in the cores of 
stars. They are not present In the very old first generation stars, but 
rather are present in the younger second and subsequent generation stars that 
have formed out of the debris of their exploded for^fateers. Many stars are 
part of small groups of two, three or more stars rotating about a conmon 
center of gravity. Stable planetary orbits are less likely to occur In such 
imjltiple star systems. 




All of thf stfllar characteristics described above can be measured 
from the Earth » enabling us to choose which specific stars are most likely 
to harbor life. The factor which takes very unlikely Individual occurrences, 
and makes them likely In an overall sense. Is tlw vast size of our universe. 
Thus, If we eliminate from further cor^lderatlcm all stars not meeting all 
of the above criteria, we are still left with around 20 million good 
candidates In our own galaxy. 

1.3 - How Should Me Look for Extraterrestrial life? 

Considering the time and energy Involved, the possibility of actual 
Interstellar travel seems very remote. That leaves us with attempting to 
converse from afar, probably by some form of electromagnetic radiation. We 
will have to Ignore for the present the fraction of civilizations that 
might be sending out gravity waves or other signals as yet unknown to us. 

The two obvious candidates for electromagnetic signalling are light 
and radio waves. The advent of the laser adds considerably to the possibility 
of light signalling, but one basic difficulty 1$ that of trying to outshine 
the parent star. Stars radiate most strongly In just those spectral regions 
where light signalling Is apt to occur. Another difficulty Is that the 
diffuse material between the stars tends to absorb light, making such 
signals be rather short-range. 

Radio waves, on the other hand, are not bothered appreciably by stellar 
Interference, and are regarded as being the most likely signals to look for, 
by most workers In the field. Our own present capabilities are quite formi- 
dable. If we took our most sensitive receiver, our most powerful transmitter 
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and our biggest antenna and combined them In an Interstellar communications 
link, we could conmunlcate with a twin of that system at a distance of 
100,000 light years. That fact In Itself goes a long way toward making 
the whole Idea of Interstellar communications seem possible. 

Our cIvIlIzatlOT Is an “emerging” one from the standpoint of Inter- 
stellar comminl cations* We have not yet matured to the point where we 
consider It practicable to send out signals with no hope of reply for 
hundreds of years. On the other hand, we could receive whatever signals 
already exist with imich less effort, and with the possibility of Immediate 
success. The value of such one-way communications should not be discounted. 
(Xir own radio and television broadcasts, as well as books, are one-way 
communications In the sense that we cannot immediately reply to the 
sender. Our knowledge of early Earth civilization through archaeological 
exploration comes through a communication system that is not only one-way, 
but Is also irreversible and highly unintentional. Yet It is of inestlm^le 
value to us. 

There are two types of radio signals that we might search for. There 
are those Intended for the Internal use of the sender, analogous to our cwn 
radio, TV, radar, etc., transmissions whibh Inevitably leak out Into space. 
Then there are those Intended specifically to attract attention or serve 
as a space beacon of some sort. Of the two, leakage signals are expected 
to be imjch weaker, since they represent wasted poii^r from the sender's 
viewpoint. Leakage signals are also more difficult to search for, since 
we cannot make reasonable assumptions as to expected frequencies, modulation 
methods, bandwidths, etc. We have only to consider the chaotic appearance 
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Earth leakage signals would have to an outsider to appreciate the 
magnitude of this problem (Sullivan, 1978 )« Beacon signals, on the 
other hand, can be expected to be optimized In the sense of making It 
easy for us to find them, since that Is tteir purpose. To be sure, we 
should not forget about leakage signals, but their discovery Is more 
likely to be accidental. 

1.4 - Anti -Cryptography vs. Anthropocentrism 

Since It Is not possible to thoroughly search the entire dimensions 
of frequency, time, space and signalling methods, It Is necessary to 
construct a search strategy that will reduce or eliminate as many of 
these dimensions as possible. This Involves making a number of 
assumptions about the characteristics of the signals we are looking for. 

Our search strategy Is based, in so far as pc^slble, on physical 
laws or properties which are true throughout the universe. The radio 
frequency spectrum can thus be limited on the lower end by galactic noise 
at about 1 GHz, and on the high end by quantum noise at about 100 GHz. 
Dispersion In the Interstellar medium limits useable bandwidth to a few KHz. 
These limitations should be recognized by any civilizations considering 
Interstellar communications. Further, communication theory suggests very 
slow signalling rates and narrow bandwidths for maximum range. 

The energy-conscious designer of an Interstellar beacon will choose 
all Its operating characteristics In such a way as to maximize Its 
probd)111ty of detection. This means that If a search dimension can be 
minimized or eliminated by a design choice. It will be. An exmnple of this 
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Is the dimension of time. If a beacon transmits In different directions 
at different times, we would be lucHy to be looking toward It at just 
the time It was transmitting toward us. On the other hand. If a beacon 
transmits continuously and omni 'directionally, we need not concern our- 
selves with the unknown of when to look at It, and the dimension of time 
Is effectively eliminated. 

Thus, one approach In tte design of a signal detector Is to develop 
an Idea of what sort of signal would constitute an optimal Interstellar 
beacon. In terms of distingulshablllty from natural sources, as well 
as providing ease of reception. On such grounds, one conclusion (Dixon, 
1973) Is that the signal Is In the 1-3 GHz range, circularly polarized (with 
binary left and right hand modulation), narrow band, and possibly Doppler 
corrected. For the reasons above, the Oasis signal detector, as seen In 
the next two chapters. Is especially sensitive to narrowband polarized 
signals. 

In the event that such a signal Is not the optimal beacon, or If the 
transmitted signal Is nonoptimal, the OSD Is designed to be sensitive to a 
broad class of signals. Including those transmitted routinely and acciden- 
tally from Earth Into space. 

1.5 - Why Should He look? 

Consider the continuum of possible development levels of civilizations, 
and their corresponding communications capabilities. Our civilization has 
only very recently achieved the technology necessary for Interstellar 
conimjnlcatlons. We cannot communicate with less developed civilizations. 
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because they cannot construct their half of the circuit. We can conmunlcate 
(with difficulty) to otiier civilizations of equal development to our own, 
and can do so with Increasing ease to civilizations more and more advanced 
than ours. If we assume that we are midway along tt« Infinite time line 
of development, then we cannot reach all those others In the less developed 
half. The chances of finding another civilization of equal development 
to our own Is small, since that Is analogous to finding a single (Mint 
on an Infinitely long line. This leaves only the more advanced civlllzatiws 
as being those most likely to be found. 

We know that our own civilization Is capable of destroying Itself 
thr(Migh wars , overpopulation, pollution and other causes. It Is possible 
that all_ civilizations suffer such a fate at some critical point In their 
development. Perhaps that Is a general law of nature; we simply do not 
know. If this were the case, we might never find another civl llzaticm. 

Cn the other hand. If we were to find another civilization (that would very 
llkily be more advanced than we), this would be evidence that It Is at least 
possible for a civilization to attain maturity without destroying Itself. 

This might provide the Impetus we need to redoiA>le our efforts to eliminate 
such possibilities. Perhaps the signals themselves would tell us how to 
solve these problems. This might be referred to as a cosmic feedback 
effect, where the transfer of knowledge from one civilization to another 
tends to stabilize, preserve and synchronize them. 

Our sense of perspective would change If we realized we were not alone 
In the universe. Perfiaps some of our local problmns would not seem quite 
so Important. Looking at things from a larger viewpoint might reduce the 
foolish. Jealous and diauvinistic tendencies we have. 
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Mankind naads a cont1nu1n9 challanga* Without 1t» cultural 
stagnation will sat In and our civilization will <tec11ne. Thara imist 
always ba naw frontiers and adventures to face or wa will sit back on 
our laurels and let the universe go by. 

1.6 - How Can We Understand Their Messages? 

Whatever they say* It will be said In such a way as to make It as 
easy as possible for us to understand It. Linguists s^y that a Common 
Fr « of Semantic Reference (CFSR) must exist before meaningful communi- 
cations can occur In any context. In other words* mutually conmon know- 
e(^r or expediences must serve as the basis for Initial understanding. 

Once such a basis Is established* knowledge which Is known only to one 
can be Imparted to the other. We have successfully translated ancient 
lost languages, but a large CFSR exists between any two races of humans. 
Dolphins are apparently quite Intelligent* and have complex speech* yet 
we have been unable to communicate with them. The CFSR Is less obvious 
In ^at case. 

An obvious CFSR that we imist share with any Intelligent technological 
civilization Is that of counting and numbers. This has been exploited by 
Hans Freixlenthal (I960)* a Dutch mathematician* who created Lingua Cosmica. 
The basic Idea i to send a repetitive message which begins with a 
counting sequence and builds up logically by telling how to answer a 
transmission or telling where other transmissions might be found. The first 
message might be referred to as an entrance examination. 




Each subsequent message or lesson Introduces exactly one new 1(tea or 
symbol, using It In many different contexts with previously defined 
symbols, to make Its meaning obvious. For example, the conc^t of 
"greater than" can be Introduced by using a new symbol ”?" In context with 
the numbers learned in the entrance examination, I.e., 

2?1 3?1 3?2 4?1 4?2 4?3 (etc.) 

Other schemes have also been created (see Arbib, 1979). 

The Important thing to note here Is not that we really expect Inter- 
stellar signals to be exactly like Lingua Cosmica, but that at least 
metho(te seem to exist for Information to be sent, received, and under- 
stood among different civilizations. 

1 .7 - What Is Being Done Now? 

To date, we have only scratched the surface of what we are capable of 
doing, in terms of actually looking. Contrary to popular belief, radio 
observatories do not spend all their time searching for Intelligent signals. 
It would be closer to the truth to say that radio observatories never 
search for Intelligent signals. The actual situation is that most obser- 
vatories never search at all, a few search for a tiny fraction of their time, 
and only one searches all the t1n«. A long term search for pulsed signals 
Is being conducted In the USSR, using very small, non-directlonal antennas. 

A full-time search for narrow-band signals using a large radio astronomy 
antenna has been In operatic since 1973 at Ohio State University. The 
various searches to data are summarized In Table 1.1 (excerpted for data 
provided by Jill Tarter of NASA-Ames Research Center). 
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Table 1.1 - Past and Present Search Projects 
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All of these searches basically employ previously-existing radio- 
astronomical equipment and techniques. No pc^ltlvely Identified signals 
from extraterrestrial civilizations have been found, although several 
single-event unexplained cases have been noted (Kraus, 1979). Numerous 
papers and books have been published the subject, and many national 
and International conferences have been held. A scientific journal 
devoted exclusively to the subject (called Cosmic Search, Kraus et al., 
1979) Is noM being published. 

To give proper perspective to what Is being done now In comparison to 
what could be done, consider the seared for earthquakes In the state of 
California. An easy earthquake search experiment that can be performed 
without specialized equipment Is to simply stand outside somewhere In 
California for 10 minutes and note carefully If you feel any vibrations. 

The most likely outcome of this experiment Is not Indicative of the true 
situation, singly because Its scope Is too small. 

In 1971, NASA sponsored Project Cyclops (Oliver, 1971), a feasibility 
study of what would be required in terms of time, money, and personnel to 
mount a full-scale search for Intelligent extraterrestrial radio signals. 
The basic recommendations were to construct several 100 meter d1 amieter dish 
antennas, and search the nearby stars In great detail. If no signals were 
found, additional dishes could be constructed In stages, up to a limit of 
about 1000, to search progressively more distant stars. If a signal were 
found at an early stage, the entire array would not need to be constructed. 

In 1975-76, NASA held a series of Science Workshops on Interstellar 
Communications (Horrison, et al., 1977). A wide variety of well-known 
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scientists In variotis fields were convened to re-examine the SET! cono^t 
In light of current knowledge and tedmology. The ccxfisensus of all those 
attending was that It was both timely ai^ feasible to be^ln a serious search 
for extraterrestrial Intelligence and that It can be done now at only modest 
cost. 

1.8 - Detection Ted^nlques Used In Previous SETI Projects 

All current and previous SETI projects use equipment that Is either 
simple and Inexpensive* or that has been adapted from some other original 
purpose. The techniques may be broadly grouped Into two categories-* 
those which search for wideband pulsed signals and those which search for 
narrowband continuous signals. Refer to Table 1.1 for more details (xn 
these various searches. 

W1(teband Pulsed Signal Techniques 

Two projects In the USSR employ widely separated (thousands of miles) 
receivers, with simple non-dl recti cnal antennas, to make tape recordings 
of pulsed signals. The tapes are later sent to a central location and 
cross-correlated to determine If any time- coincident pulses were observed 
among the various receivers. Typically each receiver observes mar\y 
random pulses due to RFI, lightning, and other phenomena local to each 
receiver. Ar\y such pulses would not be observed at the other receivers 
however, so they are eliminated by the cross-correlation process. Pulses 
which are received simultaneously at all or most of the receivers me^y be 
Inferred to have a cannon, most likely extraterrestrial, origin. 

A second tedinique for finding pulses Is the use of pulsar seardt 
equipment, being done In 6erm«\y. Pulsars are natural astronomical objects 
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that emit periodic broadband pulses, which are received as pulses that 
sweep downward In frequency with time. The apparent frequency sweep Is 
caused by the velocity of wave propagation In the Interstellar medlun 
being a function of frequency (1,e. , high frequency signals travel faster 
than lew frequency signals). Pulsar detection equIfHnent typically uses 
several receivers spaced In frequency, to observe both tiie pulsing and 
sweeping phenomena. 

Narrowband Continuous Signal Techniques 

The great majority of projects search for narreadiand signals, from 
specific target objects. Their techniques differ from one another 
principally In the method and degree of achieving narrow bandwidth, and 
in the nimher of receiver channels In simultaneous operation. In 
virtually all cases, the equipment used Is adapted from radloastronomical 
natural line detection apparatus. 

Banks of analog filters are used to achieve moderate bandwidth 
(10 kHz) and a moderate number of channels (50). To achieve a larger 
nwiher of channels (1000) autocorrelation receivers are ised. These 
receivers digitally confute the Fourier Transform of the autocorrelation 
function of the Incoming signal In real tire, giving as output the power 
spectrum of the signal. For extremely narrow bandwidths (10 Hz) and high 
number of channels, the Incoming signal Is recorded on a high-speed tape 
recorder for later (non- realtime) processing. The tapes have been 
analyzed by a variety of techniques, Inclullng direct Fourier Transformation 
In a large mainframe coim)uter, optical Fourier Transformation using a laser 
and lens system, and Fast Fourier Transformation using a mini -computer. In 
all cases, the power spectrum Is computed and Irmpected for peaks. 
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The project In Ohio employs a number of specialized SETI signal 
detection algorithms. The sl^ Is cttitinuously swept* at the Earth's 
rotation rate. In that mode of observation, the Intensity of signals 
which come from small-angular-extent objects appears to trace out the 
antenna pattern, as a function of time. This fact Is used by continuously 
confuting the normalized cross-correlation function of ti»e received data 
with the antenna pattern. In each channel. This procedure discriminates 
against large diameter natural astronomical objects and short-time peaks. 

In addition, adjacent channels are Intercompared and If a detection occurs 
simultaneously on two or more channels. It Is Ignored. This discriminates 
against w1 de-band signals. When a signal Is detected In one of the 
channels, a narrowband tuneable filter Is autwnatlcally activated and 
swept slowly across the frequency range of the detection channel, to 
provide a re-examination of the signal at 1/10 the discovery bandwidth. 

RFI Is largely eliminated through use of a beam-switching technique, where 
the receiver output Is the difference between two closely-spaced beams on 
the sky. Any r^al source appears in only one of the beams at a time, and 
hence Is observed normally, RFI In general Is received In the sidelobes 
of the antenna, which are the smt« for both main beams. Thus RFI is 
effectively subtracted out. 

Several other programs make use of data and observations done for 
other radloastronomical purposes, which are then reanalyzed with SETI In 
mind. This Is the case for the projects in the Netherlands (examination 
of radio sky maps at known star positions). In California (a parasitic 
receiver that makes a separate recording of narrowband signals, from what- 
ever the radio telescope happens to be looking at), and the recent one by 
Cohen and Nalkam (examination of data originally taken to find natural lasers). 
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The next quantwn jimp In signal sensitivity requires the use of 
receiving ^ulpment designed and constructed specifically for SETI. This 
Is analogous to building seismographs to detect earthquakes In California. 
Existing radio telescope antennas are quite adequate for the present; only 
their receivers need be changed. Fortunately, receivers are imich cheaper 
than antennas. 

Before such specialized receivers can be coretructed, they must be 
designed. Even before that, they must be conceptualized. Decisions must 
be made as to Mhat typ» of signals to expect, and optimum search algorithms 
designed to detect them. Such signals might be from beacons. At the same 
time, means must be devised to catch th(»e signals which are unpredictable, 
amorphous or strange. Such signals might be leakage. Careful searching 
takes a long time, requiring full-time operation at several (*servat1ons 
around the earth, for many years. This Is analogous to operating the seis- 
mograph for ten years. Instead of ten minutes. 

Overlaying all extraterrestrial signals are the overwhelmingly powerful 
signals being generated by our own radio and television stations. The SETI 
receiver must operate in the presence of such interference constantly. Even 

worse than the signals generated here on Earth are those transmitted by 
artificial satellites, for they look directly down the throat of the SETI 

antennas, ’he number of transmitters In operation, both on Earth and In 
space. Is growing rapidly with time. Eventually this will make It Impossible 
to conduct SETI progr«ns anywhere on the surface of the Earth. The alterna- 
tive is to construct SETI antennas In space or on the moon. This will be 
much more expensive and cwjld not occur for many decades. Thus there Is 
a great urgency to proceed now. 
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MO - The Receiving Systew 

Figure 1.1 shows conceptually the components \«^1ch make up the SETI 
receiving system. A radiotelescope containing two orthogonally polarized 
feed antennas Is connected to two separate receivers. The output of each 
receiver Is fed to two separate Hu1t1*Channe1 Spectrum Analyzers (MCSA's). 
The MCSA Is a special purpose SETI device, capable of analyzing 8 141z of 
bandwidth with a resolution of 1 Hz. The 8 Mlz Input bandwidth Is digi- 
tized and then Initially split Into 120 bands using a digital bandpass 
filter bank. These bands are then subdivided to a resolution of 1 Hz by 
120 microprocessors that continuously compute the Discrete Fourier Transform 
of the filter outputs. Both the Input and output data Is In complex form 
(I.e. , possessing both magnitude and phase). 

Each word Is 16 bits In length, but since we use only 4 as being of 
sufficient accuracy, then the output data rate 1$ 16 megabytes per second. 
This data Is fed to the Oasis Signal Detector, where It is searched for 
possible Intelligence. In actual operation, the radio telescope examines 
a target star for up to 1000 seconds. All of the data for a given target Is 
to be available for computation at one time, and a total of 16 Gigabytes 

of data Is stored digitally. 

To give perspective to the magnitude of this task, consider the 
following analogy. 16 megabytes per second Is about one encyclopedia set 
per second. These encyclopedias do not contain words and sentences, but 
are filled with random letters and numbers. As we read through these 
encyclopedias (after having taken a speed reading course), we have no Idea 
when an Intelligent message might begin (today or 5 years from now), or In 
what language It will be. To make matters worse, each encyclopedia will 
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System to Seek Extraterrestrial Radio Signals 
Figure 1.1 
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only contain one word of the inessa 9 e (average of gbytes). For example* 
the first set of encyclopedias might contain "The", and the second set 
"message", and the third "Is", and the fourth "zk3ja". As usual, just when 
you get to the good part, the data got garbled by noise, but this serves 
to Illustrate the real fact that noise will always make the signal less 
clear than It otherwise might be. 

To make life simpler, we assim« that the message will always occur In 
a c(X)s1stent volume and on a consistent page, once It gets started, for 
example. It might always be In the J volwne on page 258. Of course we don't 
know which volume or page It's on, so we still have to search everything, 
but this assumption allows us to reorganize our search. We use a special 
device called an HCSA that rips all the pages out of each en^clopedla as 
it appears, sorts them out by volume and page mnber, and rebinds them 
again (all In one second). Now we have a new set of encyclopedias to read, 
but all the page 25B's of all the J volimies now are bound together In 
sequential order. We still have just as much reading to do, but now we 
don't have to remember as much. 

Occasionally the random letters and numbers will appear to say some- 
thing purely by accident. The analogy here Is that enough monkeys at 
enough typewriters must Inevitably produce strangely real (Erases. We call 
these false alarms. They are expensive because they force us to stop 
everything and Investigate their source. We try to avoid them If possible, 
but we have to be careful that In doing so we don't miss the real signal. 

We are also bothered by occasional pages from t(« Readers Digest that 
are pasted right over some of our encyclopedia pages. They correspwwl to 
our receiver picking up terrestrial radio and television stations. TTiese 
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pages are usually e^y to spot and skip ewer* but they are distracting and 
they cover up an encyclopedia page that might have contained the message. 

The last complication Is the fact the data In reality canes from two 
antennas of different polarizations. This is analogous to splitting the 
encyclopedia In two, with all the letters in one half and all the numbers 
In the other half. We don't know If the message will be all letters, or all 
numbers, or some combination of the two. We have to read both the letters 
and nonbers simultaneously, trying to Intersperse them appre^JH ately to 
make up the entire message. 

1 .11 - Purpose of This Study 

This study is a design of that part of the specialized SETI receiver 
discussed above that actually finds and recognizes the signals. The Ideas 
and designs Incorporated In this system will undoubtedly be refined and 
modified by future workers . Eventually a system will actually be constructed. 
This Is the beginning of the next step. 

1.12 - The Oasis Signal Detection System 

Three separate signal detection philosophies are eibodled In the Oasis 
system. Illustrated conceptually In Figure 1.2. The Carrier Wave Signal 
Detector seeks the extreme case of a signal that has nearly zero bandwidth, 
and may be drifting slowly with time. The Pulse Signal Detector seeks the 
other extreme case of a signal that has a broad bandwidth, and may be pulsed 
In t1n«. The Battery of Tests seeks all those signals In between the extreme 
cases; those that are unpredlct^le. 

Each of these detection methods will be explained In detail in the 
chapters to follow. Chapter 2 discusses the philosophical differences between 
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Figure 



signals and noise. Chapter 3 lays the detailed math«nat1ca1 bases and 
philosophies of the Oasis detection algorlthns. Chapter 4 sutmarlzes the 
overall Oasis system In block form, and provides hardMare Implementation 
details. Chapter 5 focuses on the unique capi^llltles of tiie numan being 
to detect 'unexpected signals, as well as the role of the operator In the 
Oasis system. Chapter 6 discusses the radio astrononiy applications of 
the system. Chapter 7 provides 111ustrat1(Mts of non*astrophys1ca1 areas where 
the Oasis system could be put to use In solving a variety of current scienti- 
fic problems. 
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Qtapter 2 
Signals and Noise 
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Everything gets nritten down, formally, so that 
you know at all times where you are, where you've 
been, vdiere you're going and iidtere you want to get. 
In scientific work and electronics technology this 
Is necessary because otherwise the problems get so 
complex you get lost In them and confused and 
forget what you know and what you don't know and 
have to give 

—Robert M, Pirsig, Zen and the Art of Motorcycle 

Maintenance 


2.1 » Introduction 


The primary purpose of the signal detector Is to differentiate between 
signals and noise. This separation process Is based, of course, on the differing 
characteristics of signals and noise. In this chapter we explore these charac- 
teristics, and discuss how they can be used to distinguish one from other. 

The actual Implementaticm of specific algorithms to accomplish this separation 
Is discussed In Chapter 3. 


2,2 - Classifying Signals and Noise 

In this section an arbitrary classification of Inputs to the signal detector 
Is presented. The classification Is based on the origin of the Input. The Input 
will be called either signal or noise and the signal can have one of three forms 
as shown In Table 2.1. It is the purpose of the signal detector to pick out any 
of the three signals, saving It for subsequent Identification, and reject noise. 


Classification 

Si gnals 


1. 

Natural - Astronomical 

2. 

Radio Frequency Interference (RFI) 

3. 

Ikildentifled (Includes ETI) 

Noise 



Table 2.1 - Classifying Signals and Noise 
of Astronomical Interest 
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The first signal form comes from natural sources. These Include* for 
exav^le* pulsars* spectral lines frcm Interstellar clouds* the galactic 21 cm 
line* radio continuum sources* etc. Such natural signals are to be Ictentifled 
and saved as of Interest to astronomers. 

The second signal form is radio frequency Interference (RFI) which Is 
any man-made electromagnetic radiation originating from sources such as 
radio systems (both ground-based and orbiting satellites)* radar* ignition 
systems* household appliances* etc. .depending on the (4)servat1on frequency. 
These signals are not usually of direct Interest by themselves. However* 
they are in gei^ral sufficiently different from noise that they will be 
picked out by the signal detector. In many situations It will be desirable 
to catalog such signals so that later Inputs of the same form can be 
recognized as RFI by the signal Identification stage. 

Unidentified signals are simply Inputs which appear to be different from 
noise but which cannot be confidently Identified as either astronomical 
or RFI. They may In fact be one of the two, or they may originate from an 
Intelligent source beyond the solar system. 

Noise Is defined In Section 2.4 below. 

2.3 - A Global View 

In this section we consider the environment In which signals are generated 
and noise Introduced, and how a system to distinguish between the two Is 
structured. Figure 2.1 shows a conceptual block diagram of a con 4 >lete 
communications system. 
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Message Source 1 Channel ^ RCVR/MCSA 



Figure 2.1 - Generalized Communication System 






The message source block may represent an Ignition system In a nearby 
automobile, a pulsar or hydrogen cloud someMhere In the Nllky Way, or 
perhaps an Intelligent civilization many light years from earth. 

Hie channel represents the medium through which the signal Is trans* 
mitted. It may be a few miles of atmosphere for a nearby RFI, or scores 
of light years with components sudt as atmospheres and Ionospheres, Inter- 
stellar clouds, and nearly empty space. Some of these transmission media 
can have important effects on the signal as It travels between message 
source and receiver. 

All the noise Is assuned to be added to the signal at the receiver. In 
fact the noise may have come from far out in space, or may have originated 
In the receiver itself. It is assumed in this study that the receiver and 
the NCSA already exist. They are not of direct interest to our design except 
for some control functions associated with the best operation of the signal 
detector. 

The signal detector is the focus of this design project. It nas the 
task of detecting any signals which exist in the expanse of noise which 
permeates the electromagnetic envirormient. Its design is dictated by the 
assimv>tions which are made about noise and about the various signals. In a 
classic communication system it is usually assumed that the general character 
and the statistics of signals and noise are known, and are used to determine 
the nature of the signal detector. In our case nothing is known about any 
potential ETI signals, and there Is uncertainty about the exact nature of the 
other signals and the noise which will be received. For this reason the 
lines entering the "assunptions" block are shown as dashed. Despite this 
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uncertainty* assumptions must be made because they are an essential pre- 
reHulstte to the design of the signal detector. These assunqjtlons are 
presented In Sections 2.4 and 2.5. 

The output of the signal detector gMS to a signal Identifier which lias 
the task of (tetermlning whether the signal Is natural, RFI* or unidentified. 
This block Is not a primary focus of the present study although It does 
receive some attention. The signal Identifier directs the signal to the 
appropriate user. 

Unidentified signals are stored temporarily or longer for attempted 
identification by the operator, with the aid of a separate RFI detection 
and Identification systems and various other tools available to the operator. 

Finally, of course, we shall want to decode or interpret any ETI 
signals which are obtained. This problem has been addressed briefly else- 
where (1), and Is beyond the scope of this study. If an ETI signal was 
ever found, resources far beyond those available for the search itself would 
be brought to bear on Its Interpretation. 

2.4 - What Is Noise? 

The term "noise" is not uniquely or unambiguously defined. Sometimes It 
refers to any undesired Input. In our case such a definition could be ambi- 
guous. For example, RFI Is not of Interest or desired for SETI purposes and 
yet we tmjst Initially detect It as a potential ETI signal, since It has many 
of the same characteristics. Once Identified as RFI, we catalog such a signal 
so as to reject recurrent similar RFI by means of comparison tests. Likewise, 
astronomical signals are not desired for SETI purposes, but are sent to the 
astronomical observer for other uses. 
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For our purposes we define noise In terms of Its statistical charac- 
teristics, We then define a signal as ar\y Input whose characteristics 
differ significantly from those of noise. 

There are many sources of noise, with many different statistical 
properties. In the electromagnetic envlroiwent. However, In the part of the 
spectriOT of Interest to us here, the noise Is primarily thermal noise 
generated by the receiver a">d the Earth's atmosphere. We assume that the 
noise in the system is Gaussian in distribution, with zero mean and 
variance o^. We further assume that the noise power is uniformly distri- 
buted over the spectral range of interest, with a power spectral density 
equal to kT, where k is Boltzman's constant and T is the equivalent system 
noise temperature. Hence the noise power is kTB ( * o^) where B is the 
bandwidth of the portion of the spectrum under study. Finally we assume 
that the noise is unpolarized. 

2.5 - Signal Characteristics 

The distinguishing characteristics which are used to identify an input 
as signal are its: 

a. amplitude 

b. periodicity 

c. polarization 

d. bandwidth 

The amplitude of a signal might be used to distinguish it from noise in 
two ways. First, the average power or an^litude of the signal might greatly 
exceed that of the noise (a high signal to noise ratio). Then a threshold 
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detector cen Indicate a signal whenever sane arbitrary threshold Is 
exceeded. An alternative signaling scheme Is one In MhlOh the average 
signal power Is not necessarily greater than the noise power but the signal 
power Is concentrated Into a short time pulse. Such a signal could also be 
detected by an amplitude threshold device. Of c(xirse, Gaussian noise, 
since It has a theoretically Infinite amiplltude range, can also trigger 
such an amplitude detector, although with low probability. The detection 
process is Inherently probabilistic. A detector output caused by noise 
is a false detection of a "false alarm." 

One of the most Important characteristics of signals Is periodicity, that 
Is their tendency to repeat their form over constant intervals of time. 
Actually, this periodicity can manifest itself in a number of ways. Many 
signals have a so-called "carrier" component which is a sinusoidal signal 
with a well-defined period. Another possible periodicity Is a sequence of 
repeated pulses occurringat some fixed spacing. While it is possible to 
design a communication system with no periodicity whatsoever, such signals 
begin to resemble noise and would be very difficult to detect without 
considerable advance knowledge of their structure. One of the reasons that 
periodicity Is of such great Interest to us is that few natural signals 
exhibit a high degree of periodicity. (Pulsars are a counter-example. ) 

Signals can never be exactly periodic though they may be so close to 
periodic that we have no interest in the deviations. We are also Interested 
in signals which have relatively small deviations from periodicity as Is 
true of slowly drifting sinusoids or carriers with vei7 little modulation. 
Such signals are often called narrowband in that they occupy not a single 
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line In the spectrum, but rather a fairly small range of the spectrum. 
There are a ntmiber of ways of detecting periodicity In an Input. One 
Is by means of a filter, which might be mechanical, or electrical, or of 
SOTO other form. If the input Is a sinusoid or has sinusoidal components 
with a frequency near the resonant frequency of the filter, this Input or 
components of the Input are passed through the filter. All other signals 
and noise with frequency components not close to this resonant frequency 
are "filtered out." 

A mathematical approach to Isolating or detecting sinusoids Is the 
Fourier Transform, which has the form; 

X(f) » £ x(t)e‘^^^^^dt (2-1) 

Suppose that the input x(t) is a single periodic signal of the form: 

x{t) = (2-2) 


If f = f j In (2-1) the exponents cancel so that the only integrand is the 
constant A, leading to an unbounded Integral. If f f fj the exponential 
function rotates, such that its integral is essentially zero. Hence the 
effect of the Fourier Transform is to pick out the signal with frequency f^ 
and reject all other signals and noise. In our system the signal filtering 
process is carried out by the ICSA which is programmed to execute a Discrete 
Fourier Transform (DFT) of the form: 


, N-1 

^n “ H ^ \ ^ 

n n K 


-j2Trnk/N 


n * 0,1,2,.... ,N-1 


where the Xj^ are N samples of the time function x(t) over a total record 
(sampling) period of T seconds. 
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The response of the DFT to both signals and noise Is discussed In 
Appendix A2.1 In some detail. As Equation (A2.1-20) and Figure A2.1.3 show, the 

response of the DFT to a sinusoid across ar^ one channel has a finite width, 
fo11(Mlng a sine function law. Hence the NCSA acts like a set of very 
narrowband filters, which reject signals or noise compements outside of a 
bandwidth of about one Hertz In our case. 

Sinusoidal periodicities can also be detected by phase«1ock loops which 
act as a filter with a variable resonant frequency. The loop attempts to 
adjust Its center frequency and remain locked to this frequency as long as 
the Input frequency remains fixed. The loop can follow drifts or shifts In 
the Input frequency, and also angular modulation of the Input, as long as 
the drift rate or modulation rate does not exceed the capacity of the loop. 

The periodicity may also appear in the form of repeated pulses, sudi as 
in Figure 2.2a. The Fourier series coefficients for such a signal vary as 
shown in Figure 2.2b. The Important point here is that there are strong 
frequency components out to about 1/t Hz, where t is the pulse width. That 
is, the narrower the pulse, the more frequency content there Is in the signal. 

If, for example, the pulse width were about 10 msec, the power In the signal 
would be spread over about 100 Hz, that Is over 100 channels of the MCSA 
operating with 1 Hz channels. Hence such pulses can be sought by observing 
many diannels at once. In Figure 2.3 we show how a drifting carrier, a pulsed 
drifting carrier, and a sinusoidal drifting carrier might appear In the noisy 
output of the MCSA. 

The final signal characteristic which we consider is the polarization. 

We assume here that ETI signals are strongly polarized and noise unpolarized. 
Hence the presence of unusually high power In an Input which lacks polariza- 
tion tends to suggest an unusually high noise peak. On the other hand, some 
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Figure 2.2. Repeated Pulse Signal 


oruG'.T'J^.t P'Gn IS 

OF QUALITY 


Figure 2.3 — Simulated signals in noisa. Each display below represents a set of 
vertically stacked frequency spectra. Each spectrum has been taken at sequential 
moments in time. Brightness indicates detected powei at a frequency. Ti>e 
displays are dominated by noisa, while the linear features indicate the detection 
of carrier waves. 


(a) Carrier wave drifting in frequency 
at a constant rate in time 



(b) Pulsed drifting carrier wave 




l»lar1zation In an Input which does not have high power tends to suggest 

the presence of a relatively vwak signal In noise. 

In this section we have consictered three major characteristics of 

signals; amplitude, frequency, and polarization. Next we use these 

characteristics to define what kind of signals we will attempt to detect. 

2.6 « Signal Type Assuiwtlons 

The signal detector Is designed to detect five different general 
classes of signals. 

1. Monochromatic (Sinusoidal) 

2. Drifting Sinusoid 

3. Itepeated Pulses 

4. Narrowband Signals 

5. Patterned Signals 

These classes are not all mutually exclusive, but the classification 
Is valuable as a guide In setting up signal tests or algorithms. Some 
tests may detect more than one class of signal. 

Monochromatic or slngle-frequency sinusoidal signals are a very strong 
possibility In the SETI environment. Such signals could be used as a beacon 
to attract attention. Or they might be carriers from a radio signal, 
observed In an eavesdropping mode (2). Finally, they might be very slowly 
drifting signals with a frequency which appears to be constant over the 
observation time. In any event such signals would be of great Interest and 
would require attention. 

A drifting sinusoid could easily arise from any of the mechanisms in 
the above paragraph with the adcted effect of a change In the doppler shift 
due to planetary motion (ours and theirs). Because such an effect Is 
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expectad, and because the sinusoid Is relatively simple to transmit and 
detect, this signal Is given perhaps the highest a priori prob^lllty of 
existence, and hence a substwitlal effort Is made to detect It. 

R^>eated pulses, modulating a carrier In the spectral search range, 
have the ^orm shown In Figure 2.2a. They could arise for at least two 
reasons. First, an ETI might reason that Instead of transmitting a sinu- 
soid at some given pomrer level it might be better to transit a higher 
burst or pulse of power for a short time, and then turn the transmitter off 
between pulse repetitions. A peak power detector would have an easier 
time observing such a signal than a sinusoid. A second way In which this 
signal might arise is from a slmisol dal -signal beacon which is swept across 
the sl^ in an attempt to attract attention over a wider range than a 
focieed beam. Such a swept beacon would appear to any one observer as a 
pulsed signal. 

Narrowband signals have their power concentrated over a relatively 
narrow rang of the frequency spectrum. They could arise from monochromatic 
signals which drift, or which experience a small amount of dispersion, or 
which have modulation with a very low index. 

By **pattemed signals" we mean those which possess some degree or 
organization or pattern which might be recognized as too well-structured to 
have come from a random process. Accordingly, one of the algorithms to bo 
discussed later does search for clusters of signal characteristics, which 
can be viewed as a kind of pattern. In addition one of the primary reasons 
for introducing the human observer Into the signal detection process is to 
take advantage of the capacity of the human to detect or recognize certain 
types of patterns. 
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2.7 - Review of Signal Processing Steps 

As we In Section 2.3 there are three steps or stages In the process 
of searching for and understanding Intelligent signals from space. These 
are: 

1. Signal Detecticm 

2. Signal Identification 

3. Signal Decoding 

Signal detection, the central focus of this study. Is the process of 
distinguishing signals from background noise. In the next section we 
explore some theoretical considerations relating to this step. In Chapter 3 
m present some specific signal detection algorltNns Implemented In this 
design. 

Once we have detected a signal we turn to the problem of classifying 
it as: astronomical, RFI, or unidentified (Including ETI). If a signal Is 

Identified as astronomical. It Is sent to the deserving astronomer for 
further study. If a signal Is known to be RFI, It can be classified and 
cataloged, possibly to help with later Identification of other RFI. If a 
signal cannot be Identified as either astronomical or RFI, It Is stored until 
such time as further Identification efforts can be made. This recording or 
archiving can be done automatically or under operator discretion. 

If a signal Is believed to originate from an ETI, It must be Interpreted 
or decoded by methods and algorithms yet to be devised. 

2.8 - Signal Detection Algorithms 

In this section we consider the general types of approaches or algorithms 
which can be applied to the problem of detecting signals In noise. Specific 
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al9>r1t)m selected for our designs ere discussed In Chapter 3* We 
consider six approaches. 

1. Data aggregation 

2. Data addition 

3. Search for sinusolcfe 

4. Search for drifting sinusoids 

5. Search for polarization 

6. Pattern recognition 

Data aggregation Is a process of cond>1n1ng In any one of a nuxter of 
ways the new data. A comnon reason for doing this Is to reduce the amount 
of data which must be processed In a given aiMMJnt of time. A second 
reasem for aggregating data Is to include in one value the effects of more 
than one signal property. For example, one term might be used to measure 
both power and polarization. If we wished only to pay attention to signals 
In which both power and polarizations exist, then a multiplicative aggregate 
term could be devised. A disadvantage of aggregation Is that It sacrifices 
data. This effect may be overcome by storing the raw data for some time. 

I^ta atMItlon Is a form of aggregation which usually Is In^lemented 
with the special motivation of Increasing the signal to noise ratio. Data 
values may be added over successive fCSA bins. In which case we call this 
coarse-binning. Data can also be added for the sam bln or channel over a 
number of successive time records. The addition can also be performed 
over any other combinations of data values. 

teta addition Increases the signal to noise ratio by taking advantage of 
the fact that the data tends to add coherently while the noise adds 1ncoher> 
ently. Data additicm can be done over the airplltudes of the signals or over 
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the poMer (the sum of the squares of the real and Imaginary components). 

It may be undesirable In some cases to add amplitudes. If for exnnple 
these amplitudes are complex and the MCSA has Introduced a phase shift 
which permits amplitudes to subtract. 

There are a number of ways to search for sinusoids by taking advantage 
of their periodic nature. Most of these approaches are equivalent to 
narrowband filtering, as discussed In Section 2.5 above. One basic approach 
In our situation Is to use the MCSA as a narroMband filter and then sum the 
output of each bln over many time records. 

If the sinusoid Is drifting (dinging frequency slowly) we cannot add 
bin outputs since the signal will drift out of cme bln Into the next as 
time goes on. Two approaches are possible. The first Is to force the center 
frequency of a channel or bln to track or follow a drifting Input. Such a 
system would be In essence a phase-locked loop. A second approach Is to 
add the signal over a set of channels which are different for each time 
period, and In this way follow or track the signal. The latter approach 
was Implemented (see Chapter 3) in tbe Carrier Wave Detector. 

The search for polarization must begin at the receiving antenna which 
has two probes, each of which Is sensitive to one of two orthogonal polari- 
zations. The recelver/MCSA system carries the two resulting signals along 
in parallel channels to the Signal Detector, where they are then used to 
obtain a measure of the degree of polarization of the Input. In our Imple- 
mentation this data Is used In three of the nine MBIT tests. 

Pattern recognition Is a very general term which can mean mapy different 
things. It refers to ways In which signals are grouped or organized In some 
sense. In the design which follows, the Cluster Detector Is a form of pattern 
recognition device. 
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Taking Three as the subject to reason about— 

A convenient number to state— 

We add Seven, and Ten, and then multiply out 
By One Thousand diminished by Eight. 

The Result we proceed to divide, as you see. 

By Nine Hundred and Ninety and Two. 

Then subtract Seventeen and the answer must be 
Exactly and perfectly tri«. 

The method employed I would gladly explain, 
while I have it so clear in ny head. 

If I had but the time and you had but the brain 
But much remains yet to be said. 

— Lewis Carroll, The Hunting of the Snark 


3.1 - Purpose of This Chapter 

This chapter will Introduce the Oasis Signal Detector (OSD), explain 
its role in a search for Intelligent signals, and develop, in some detail, 
the algorithms that this device employs as it shifts and sorts through giga- 
bits of data. 

As mentioned in Chapter 2, the OSD examines the output of the multi- 
diannel spectrim analyzer, in order to recognize signals therein, and, subject 
to the control of the operator, determines the nature of the signals as one 
of a) RFI (radio frequency Interference) 

b) natural astronomical bodies 

c) ETI (extraterrestrial transmissions). 

The OSD also records Interesting portions of the input data (selected by itself 
and/or the operator, at the operator's discretion), and at all times, monitors 
the entire SETI search system. 
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3.2 - Overview 


Real Time Processing 

The OSD Is a real time Instrument, that is, it recognizes a signal, if 
any is present, during the time the antenna is pointed at a particular 
target, or shortly (16 minutes) thereafter. Once a signal is detected, 
identification tests are made with the aid of the operator and the OSD's 
RFI detection system, as well as the radio astrononiy observer at his own 
output station, using a computerized catalog of radio sources. This rapid 
performance of identification tests eliminates costly and time-consuming 
reobservation of targets, and allows for the detection of transient types of 
signals of any of the three types. Reobservation will therefore occur only 
when there is a high probability that a detection was not a false alarm. 

The length of observation of a target is nominally set at 1000 seconds 
(16.6 minutes), but since the desired length of time may vary appreciably, 
the system is configured to handle observations of any multiple of 20 
seconds in length. 

Signal Recognition Procedure 

The niBnber of known signal modulation schemes is arbitrarily large. 

While it was felt important to design the OSD to be able to detect as mariy 
different signal forms as possible, we also felt that the OSD should be highly 
sensitive to those types of signals deemed optimal as interstellar beacons, as 
well as to high power signals leaked into space from the Earth. The most 
dominant components of these signals are very narrowband polarized carriers and 
pulsed RF signals, which to a distant observer may appear to be drifting slowly 
in frequency. To respond to these pulses and carrier waves, two special purpose 
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detectors were (tesigned. They are called the pulse detector (PD) and the 
carrier wave detector (CMD). They are designed to be as sensitive as possible, 
with an enterir^ data point Integrated Into many different sims and products, 
each of which represents a different type of possible drifting carrier or pulse. 

Since artificially generated carrier waves are often constant In amplitude 
from second to second, relatively stationary In frequency, and always 100% 
polarized, a detection method highly sensitive to signals with such attributes 
was developed. This method (or measure), called Generalized Coherence, Is 
described fully in the latter part of this chapter, and Is employed within the 
OiD algorithm. Thus the OSD Is highly alert to the presence of polarized carrier 
waves, even If they are Doppler- drifting. 

Beyond providing signal-specific algorithms, the CSD has the facility to 
detect signals of an arbitrary and unknown nature. This Is accomplished by a 
two-stage processor called the Numerical Battery of Independent Tests (fffilT). 

The first stage analyzes data from the NCSA In small parcels, and reports to 
the second stage a suninary of the attributes of the data In each parcel. To 
this end, each parcel (called a block) undergoes a battery of tests. If any 
block scores outstandingly in any of the tests (which examine total power, 
polarization, and the presence of narrowband wplltude spikes, and more), the 
operator Is alerted. In addition, a second stage always examines the report 
of the first stage (even In the absence of outstanding features). The second 
stage looks for the presence of trends, or patterns, such as the existence of 
blocks adjacent In time or frequency which possess similar attributes. This 
second stage Is called the Cluster Detector (CD). 

The terms PD, CWD, NBIT, and CD represent both the algorithms and the 
hardware processors that perform these operations. These four processors comprise 
the computational part of the OSD, called the realtime processing system, and are 
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described mathematically In this chapter, and as hardware devices In the next 
chapter. The realtime processing system Is shown In Figure 3.1, 

An "alarm" Is the term used In this report to Indicate an Internal signal 
or flag from a processor to the hwnan Interface system describing the detection 
of a signal In noise. An alarm contains ttie Information; What (how mud)), vdten, 
and where (frequency). 

Human Interface 

The operator Interface system consists of the operator, a central con^uter 
(which allows the operator the access to converse rapidly with the system), 
interactive graphic displays, a small archive in which either he or the central 
computer may decide to retain data segments of special interest, and a package 
of software routines which allow him to explore the data in his own way. 

Most importantly, the operator is given access to the total observation 
archive which contains temporarily all of the data collected on the current 
target. He Is shown the alarms as they come In, and is free to either let Vtte 
system decide that which Is Important enough to save or else to make the 
decision himself. Providing aid in this task is the RFI detector, and the 
Radio Astronomy System, since each unit is capable of automatically Informing 
the central computer that certain alarms are Identifiable as non-ETI. Those 
signals of astrophysical Interest will be processed and recorded independently 
by the Radio Astronomy System. The Himian Interface System and the Radio Astro- 
nomy System are described In Chapters 5 and 6 respectively. 
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Figure 3.1 The Oasis Signal Detector 








3.3 - Generalized C(tfterence 


Background 

It is widely believed that intelligent signals will be coherent with 
respect to both polarization and time. It is therefore desirable that the 
receiving system measure these two signal characteristics and use them to 
discriminate agaif?st sign5‘’s which are not coherent, resulting in a 
decrease in sensitivity to incoherent signals. Since noise is incoherent, 
the discrimination against noise results in a substantial increase in 
sensitivity to coherent signals. 

Polarization coherence and time coherence are two independent 
characteristics, and can be measured for any received signal. Polarization 
coherence is a measure of the consistency of the anplitude and phase 
between the signals received by two orthogonally polarized antennas. Time 
coherence is a measure of the consistency of the amplitude and phase of a 
single signal, with respect to a time-delayed version of itself. Although 
these two types of coherence are determined in quite different ways, it 
will be shown below that when both have been taken into account they lose 
their individual identities and merge into an overall (Generalized Coherence . 
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Polarization Matching 


The receiving system contains t»<«) inctependent receivers, connected 
to orthogonally polarized feed antennas. Typically, the feed antennas 
will have horizontal and vertical, or left and right circular polarizations. 
It would be possible to process the data from each antenna separately, but 
this is undesirable because it essentially doubles the cost of data 
processing. A preferable procedure would be to confine the two antenna 
outputs by some method, and then process the resulting single signal, 
provided that this can be done without a significant decrease in 
sensitivity. One might even suspect that if the two antenna outputs were 
confined in some optimum way, the sensitivity would be further increased. 


Measurino the Polarization of a Received Signal - In order to examine 
various methods of handling the two output signals, some basic polarization 
theory is required. Suppose the feed antennas are horizontally and 
vertically polarized, and denote their complex output voltages by x and y. 

This notation will be used throughout this report. (The choices of horizontal 


and vertical, as opposed to left and right circular, is arbitrary but generally 
done in polarization discussions. It will make no difference in the final 
result.) The type of polarization of the incoming wave is completely determined 
by the ratio of the magnitudes of the two outputs, and by their phase angle 
difference (Kraus, 1966, and Cohen, 1958) and is conmonly expressed as two angles; 

« “ -180° < 6 < +180° (3-1) 


tan 


-1 


0 < Y 1 90" (3-2) 

which can be plotted on a spherical surface known as the Poincare sphere 
(see Figure 3.2). 
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Figure 3.2 - The Poincarl Sphere 




Having measured 6 and y for any incoming signal, its polarization type 
may be plotted at the appropriate point M on the Poincar€ sphere. For 
example, suppose the two outputs have equal magnitudes and differ in phase 
by 90°. Then 6 = 90° and y = tan“H = 45°. Plotting this on the 
Poincarfe sphere, we see that the received signal is left circularly 
polarized. 


Setting the Polarization of the Receiving Antenna to Match That of the 

Signal - Given two orthogonally polarized antennas, it is oossible to combine 
their outputs in such a way as to syntnesize a single antenna of any 

desired type of polarization. This is the inverse of the measurement 
problem, and may be stated in this way: Given the desired type of 
polarization, as specified by the angles 6 and y ' what are the 
weighting factors needed to combine the two antenna outputs in the form 
Z = K, X + K 2 y (3-3) 

in such a way as to optimize reception for that type of polarization? 


These weighting factors are derived in Appendix 3.1, giving the 


relation 


7 s 


X cos Y + y sin Y 


(3-4) 


When searching for signals whose polarization type is unknown, a 
possible optimum method is to use the signal itself to control the x and 
y combining process, thereby adaptively matching the polarization type of 
the antenna to that of the incoming signal. This may be done by i I *st 
measuring the type of polarization {y and 6), and then synthesizing the 
antenna that matches that polarization. This combined operation can be 
carried out in real time, using the single relation (developed in Appendix A3.2), 
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(3-5) 



Comparison with Alternative Polarization-Handling Methods - One 
simple method for dealing with signal polarization is to add the powers 
coming out of the two antennas ignoring all polarization considerations. It 
is a fact that the total signal power will always be otained by doing this, 
regardless of the signal or receiving antenna polarization. However, the 
fact that two independent noise powers are added always decreases the SNR by 
a factor of where the polarization matching method weights the two so 
that the loss ranges from 0 to Another disadvantage is that the signal 

cannot be tested for tinw coherence at a later stage in the analysis, if 
power is calculated at this point. 

A second simple method is to add the amplitudes. This corresponds to 
setting K^=I<2=1 in equation ( 3-3 ) above and means in reality that we are 
receiving at a single polarization (linear at 45^, assuming x and y 
antennas). Any other polarization will suffer some loss, ranging up to 
infinite for the opposite case (linear at 135*^, for x and y antennas). 

The two polarizations could also be processed completely independently. 
This means that we now are receiving with two polarizations (corresponding 
to whatever type of antennas are being used). Other polarizations suffer 
some loss, up to 3 db for those halfway between the two in use. (e.g. - if 
X and y antennas are used, then left and right hand circular both have 
3 db loss. ) 
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By combining the antenna outputs In various ways (I.e., chocking 

Kj and Kg), ai\y nun^r of specific receiving polarizations can be 

synthesized. In Project Cyclops It was suggested that 4 or 6 cardinally 

chosen polarizations be created In this way, and processed separately. This 

reduces the possible loss, since a signal polarization must then fall 

closer on the average to one of the receiver polarizations. The disadvantage of 
this Is that the data processing requirements Increase in direct 

proportion to the number of polarizations. 

All of these various methods are summarized in Table 3.1. The 

average losses were calculated assuming the signal polarization types 

are uniformly distributed around the Poincare sphere. 


NunA>er of 
Polarizations 
Method Searched 

Minimum 
Loss (db) 

Maximtmi Average 
Loss (db) Loss (db) 

Relative 
Amount of 
Data Processing 
Required 

Add Powers of 
the 2 Antennas 

0 

1.5 

1.5 

1.5 

1 

Add An^litudes of 
the 2 Antennas 

1 

0 

oo 

5.1 

1 

Process 2 

Antennas 

Separately 

2 

0 

3.0 

1.3 

2 

Synthesize 4 
Polarizations 
and Process 
Separately 

4 

0 

3.0 

0.7 

4 

Synthesize 6 
Polarizations 
and Process 
Separately 

6 

0 

1.0 

0.4 

6 

Signal -Matching 

00 

0 

0 

0 

1 


Table 3.1 - Comparison of different polarization 
processing methods 
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It is in^rtant that the polarization synthesis not be done in the RF 
or IF portions of the receivers, as is commonly done. To (k) so means that 
all signals at any instant of time will be received with the same 
polarization. On the other hand, if the polarization synthesis is done 
as part of the data processing (as in the signal -matching method), then 
each signal can be received with a di f ferent polarization. 

Polarization Coherence 

Partial Polarization - It is not necessary for real-world signals to be 
polarized at all. Most natural radio sources radiate unpolarized signals. 
Some natural radio sources are partially polarized, radiating signals 
containing both polarized and unpolarized components. Any signal radiated 
from a single antenna, however, is always completely polarized. We use this 
fact to great advantage. 

The power received by an antenna of polarization type M in response 
to a signal of polarization type M' is 

Pr = P 5 Q/2 + m ( cos" ^ - 1/^ (3-6) 

where P^ is the signal power 

m is the degree of polarization; 0 £ m £ 1 

MM' is the polarization mismatch angle between the signal and antenna, 
as plotted on the Poincare sphere; 0 £ HM' £ ir 
When the signal is completely unpolarized (i.e., m = 0), the received power 
is one-half that of the signal power, regardless of what receiving antenna 
polarization is used. When the signal is completely polarized (i.e., m * 1), 
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the received power lies between zero and ttie signal power, depending on 
tite polarization mismatch angle. 

The degree of polarization Is (tefined in Appendix 3.2 as 

m - /i , 

¥ ()oF + xy*}* 

The degree of polarization may be regarded as the radius of the 
Poincare sphere. A con^letely polarized signal has radius of one {1.e.» the 
unit sphere), whereas a completely unpolarized signal has a radius of zero 
(i.e., a dot at the center). This graphical representation serves to 
illustrate the fact that the type of polarization ( location on the surface 
of the Poincare sphere) becomes less and less Important as the degree of 
polarization ( radius of the Poincare sphere) becomes smaller and smaller. 

In the extreme case of an unpolarized signal, the type of polarization 
ceases to exist, as exemplified by the remaining dot at the center. 

Polarized Signal Component Extraction - When the receiving antenna 
polarization Is matched to that of the signal (e.g., using the polarization 
matching method described above), equation (3-6) becomes, 

= Pj ( 1/2 + m/2 ). 

This may be rewritten as 

Pr = Ps ( --f — + "> )• 

Here the first term In the brackets is the unpolarized component of the 
signal and the second term is the polarized component. Since only half of 
the unpolarized con^jonent Is received, a polarization-matched antenna acts 
as a filter, leaving the polarized component unchanged. If the powers In 
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two antennas were simply added, all of the unpolarized power would be 
recely^d thereby decreasing the SNR by a factor of 2, neglecting receiver 
noise. 

Since we are seeking signals which are polarized, only the second 
term Is of Interest. We wish to extract that fractiwi of the received 
power that is polarized, I.e., 


P 


rp 





+ m) 




2m 

1+m 


) 


(3-8) 


If we were concerned only with signal power at this point we would 
just compute the desired result according to equation (3-8). To do so, 
however, would prevent later examination of the time cc^erence properties 
of the signal. Instead, we multiply the complex signal z of equation (3-5) 
by the square root of the fraction given In (3-8) (since z Is an 
anplltude, as opposed to power). This extracts the polarized component of 
the composite signal z, I.e., 



The effects of noise on this calculation are given In Appendix A3. 3. 

An alternative to extracting the polarized signal exponent Is to 
Ignore the degreee of polarization. The disadvantage of this Is that no 
discrimination Is matte against unpolarized signals. Two signals of equal 
power, one polarized and the other unpolarized, would give the same receiver 
output. If the degree of polarization Is used as described above, the 
polarized signal would be unaffected, whereas the unpolarized signal would 
be eliminated from the receiver output. 
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Time Coherence 

The complex degree of self-coherence of a signal z{t) is defined as 

y( t) * 

This is in fact the same as the complex normalized autocorrelation function 
of the signal. The magnitude of this quantity is known as the Degree of 
Self Coherence, and 0 < |u( t)| < 1. The denominator of \ i { t) is the 
power of the signal, so tnat 

|u( t)| = 

It should be noted that the magnitude of u depends cn both the phase 
and amplitude of the original signal. It is not correct to say that the 
magnitude of u depends only on the amplitude of the original signal, or 
that the phase angle of u depends only on the phase angle of the signal. 
Thus Iv*! is a measure of both amplitude coherence and phase coherence in 
the signal. 

We are Interested in only that part of the received signal and noise 
combination that has time coherence. That component may be extracted by 
multiplying the received power by the degree of self- coherence, i.e., 

* Coherent Power » |u(t) | P^ = | z{t + t) z(t)* j 

The time lag t used above may be regarded as the coherence time of 
the power ireasurement. It must be sufficiently long that the noise 
component shows no coherence, but no longer than necessary, to allow for 
slow signal variations. The MCSA provides independent outputs at one 
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second intervals, so x must be an Integral number of seconds. A lag of 
zero will cause the noise to appear coherent, and is therefore too small. 
A lag of one or more will make the noise independent from point to point. 
Therefore we choose a lag of one, as being long enough to discriminate 
against noise, yet short enough to allow slow signal variations. Thus, 

Pp = Coherent Power = | z(t + 1) z(t)* | 


Coherent power has the useful property that it is unaffected by any 
phase shifts that are linear functions of time (see Appendix A3. 4). This is 
crucial for the current application because the NCSA introduces a linear 
phase shift into the signal that is a function of the unknown position of 
the signal frequency within a frequency bin. Another way of saying this is 
that the complex output voltage of an MCSA channel that contains a non- 
drifting sinusoidal signal, is another sinusoidal signal. The frequency 
of that output signal is the difference between the center frequency of 
the bin and the original signal frequency. Thus for 1 Hz bins, the output 
frequency lies in the range of ±1/2 Hz. Then if the amplitude of this 
sinusoid is A, we have from Appendix A3. 4 that 


P 


c 


1 

mr ^^+l 



t 


so that for a sinusoidal signal, the coherent power is identical to the signal 
power. 

Consider now an incoherent signal (such as a natural radio source), whose 
power is equal to that of the above sinusoidal signal. The MCSA output will 
be a voltage whose amplitude and phase are random variables, and whose spectnmi 
is band limited (±1/2 Hz) white noise. The coherent pov«r will be the same 
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as the signal power for the first data point, and will decrease as as 
additional data points are included in the measuretnent. Thus coherent power 
has an advantage over signal power in that it distinguishes between coherent 
and incoherent signals. 


Generalized Coherence 

Formulation - In the preceding sections, specific operations were carried 
out on the received signal, taking into account successively the type of polar- 
ization, the degree of polarization, and the time coherence. These operations 
are shown schematically in Figure 3.3. If we now contine these three 
operations into a single expression, and label the resulting quantity as 
Generalized Coherence Value (GCV), we obtain (conceptually) (see Appendix A3. 5 
for details) 


GCV 

where 


f «xn <« * '■xyn 


fyx (■) * V ♦ "yn "yy ' 


6 = tan 


-1 ^ 


Re( R^y(O) ) 


F = 



1 — 


h 

_Rxx(°) - "yyfO) 

* + 4 R^y(O) 


R,,(0) . Ryy(O) 


is the normalized power in antenna i 
Pijn the normalized cross power between antennas i and j 
R^j is the correlation function between the and antenna outputs. 
In essence, all the possible auto- and cross-correlation functions, both 
time shifted and unshifted, appear in the GCV expression, in a symmetrical way. 
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Polartzition 1 


Polarization 2 


Signal- Directed 
Polarization Type 
Matchi ng 



Extraction of 
Polarization- Coherent 
Signal Component 


Extraction of 
Time- Coherent 
Power exponent 


Generalized Coherence Value 


Figure 3.3- Qperatiwis Involved in Calculation of the Generalized 

Coherence Value 
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It has been demonstrated previously (Born and Wolfe, 1970) that the 
coherence of a signal ob^s the electromagnetic wave equation, and propagates 
through space In the sane manner as the signal Itself. This fact lends addi- 
tional support to the Idea of coherence being In some sense a fundamental 
quality of the signal. 

Real-Time Evaluation - Calculation of generalized cc^erence Involves cal- 
culating the correlation functions described above. All these quantities are 
summations of combinations of the Instantaneous outputs of the two antennas. 

In a realtime computing situation, the sinvnatlons can be calculated continuously 
as the data points become available, providing the partial results of the auto- 
correlation function are accumulated. When the desired anount of data has been 
accumulated, the summations may be combined by relatively simple computations to 
give the generalized coherence value. 

When very many outputs (channels) are monitored for generalized coherence 
concurrently, the retention of the partial results may require substantial over- 
head in terms of storage. An alternative scheme developed In Chapter 4 enables 
the serial evaluation of channel outputs, and thus eliminates the need for 
partial result accumulations. 

Performance - Noise causes the computed values of the correlation func- 
tions described above to have statistical fluctuations. In addition. If the 
correlation functions are calculated using too few data points, they 

will be biased toward the high side (as are all correlation functions). 

In order to quantify the statistical characteristics of GCV, computer 

simulations were run for a number of signal-to-noise ratios. 

Figure 3.4 is a typical example of the simulations made to investigate 
GCV. The data from all such simulations is gathered together in Figure 3.5. 
Details are given in Appendix A3. 6. 
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It is possible that GCV should be squared for optimum detection. This is 
suggested by the fact that its probability density is similar to that of a sinu- 
soidal signal in noise, just prior to square law detection in a normal receiver, 
and because GCV is fundamentally the magnitude of a vector even though it has 
units of power, GCV is the sum of many vectors, analogously to the way that a 
traditional coherent detector adds up many vectors prior to squaring their sum. 

Inspection of Figure 3.5 reveals that below a SNR of 0.6, the power calcu- 
lation is more sensitive, and above 0,6 GCV is more sensitive, and becomes 
increasingly more so as SNR increases. This advantage will be manifested as 
higher detection probabilities and lower false alarm rates. The small amount of 
data shown for the square of GCV reveals an even greater advantage. The power 
sensitivity crossover is then 0.4, and the output SNR curve climbs more rapidly. 
Actually, the relative performance of GCV vs. power is significantly better 
than the curves indicate since the power curves assume the signal polarization 
is known beforehand and exactly matched to a single chosen antenna, with the 
other antenna and its accompanying receptiver noise being ignored. The GCV 
curves, in contrast, assume no prior knowledge of the signal polarization and 
include the effects of noise from both receivers. It is believed that future 
studies comparing randomly mismatched total power detection schemes to GCV 
detection will show GCV the favored approach to even lower SNR signals. 

This can be summarized by saying that the generalized coherence technique 
achieves its goal of using the signal polarization type information in a useful 
way, and of discriminating against unpolarized and incoherent signals, without 
significantly sacrificing sensitivity, and in fact provides a significant 
enhancement to sensitivity. 


- 64 - 



GCV (Normalized) Hean Power (Normalized) 



Figure 3,4 - Comparison of Time Integration of Total Power 
with Generalized Coherence, with a Signal to 
Noise Ratio of 1.0 (in a Single Matched 
Polarization), calculated for 25 Successive 
Data Points. 
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Figure 3.5 - Comparison of Detection Methods 



3.4 - Analysis of Variance (ANOVA) Algorithm 
Background 

A shortcoming of most detection algorithms is the degree to which they 
have been designed to detect specific types of signals. While this philosophy 
serves to maximize sensitivity to these particular signals, many are blind t*' 
even strong signals outside this range. One algorithm that demands a minimum 
of prior assumptions is an analysis of variance (ANOVA). On the time- frequency 
matrix of the NCSA's output, ANOVA can perform three tests of the null hypothesis 
(that noise alone is present) to search for significant differences between 
means of rows, columns, and interaction terms. If the null hypothesis is not 
once rejected, ANOVA concludes that no signal is present. But if rejected 
even once, not only has ANOVA addressed the primary question of signal detection, 
but valuable information about the type of signal present is provi ded— wi th only 
a simple thresholding of the three results of a single standardized calculation. 

For exan^)le, if a monochromatic signal is present, it will 
manifest itself in the ANOVA matrix by increasing the power ir. the colimm 
that corresponds to its frequency. The presence of this type of signal 
would be detected in the ANOVA by a rejection of the null hypothesis of 
colimin effects. Reflecting the difference between the coluimi means 
associated with the signal and those colunrn means associated only with 
noise, the value for the sum of squares of colwim sums would increase, 
resulting in an increased variance ratio for co.wnns. If the variance 
ratio for columns exceeds a present critical variance ratio, the null 
hypothesis is rejected. 
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similarly, a broad band signal would Incroase ttie power In alt ' 
the matrix columns within the sill's bandwidth. This would result In 
hl^r column means for tlwse o)lumns containing the signal, prodi^li^ 
a proportionately more significant variant ratio ft»r tl» (X)lumns. 

1^ means and t1^ associated ram variance ratios are affected by 
pulsed broa<tt)and signals which distribute fxmer acrass many frequencies 
within one second. As with tt» colt«n variance ratio, the effect on tte 
rout variance r^tlo of more than one pulse within the matrix Is ciMulatlve, 
the optlmm al^irlttM response occurring id»n the sl^l fills (Wie row 
of the cells of the ANOVA matrix. 

f^pldly drifting signals are detected prlmari?. y the Interaction 
term of ANOVA, though significant row and colunm effects can result as 
well, (Spending upon the slope of the path of signal as It moves through 
the matrix. Drift rates greater than one Hz/sec being detected in rows, 
while rates that are less than one Hz/sec are <tetected by the colimm 
hypothesis. 

Because ANOVA makes no assu^tions atout the signal to be received 
and demands only standardized calculations. It serv^ as an InqKrtant 
global signal detector for the SETI problem. Though further analysis is 
required after detection, the results of the three tests of null 
hypothesis (Row, Column, and Interaction) greatly reduce the problem of 
signal type Identification. 
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The concepts of ANOVA celculetlom Mill be llltistrated here by Moni 
equatitms arnl exaa^les, and the teta11«1 natiMMitIcal tfiMry Is explain^ 
In ^|MfKl1xA3.8. We Mill assunM here that tite tlM-freqiaincy array ou^t 
from the NCSA has a zero mean (I.e., that ^e noise gott equally al>ove Mid 


beloM zero, as will be shoMn Is provided by baseline removal, a stMKlard 

procedure of radio » trono^y.) 

Columi ^Itmn 

1 0 

Rom 1 

t 


f 


Ron I 


a small {HirtlMi of ^ NCSA 
output array* ^r idilcih p^k 
ca1cu1at1(M)s are to be 
perfonaed 


Time 

i 


Frc?;ueri(y • 


The three ANOVA calculations are: 

( averaae of all \ 2.f average of all \ 2 . ^average of all \2 

points in Ron 1/ I pointe In Ron 2/ *** A points in Ron 01 

ROW EFFEa 

I • (average of the square of all fwlnts in the array) 


COLUMN _ 
EFFECT " 


( average of all \ 2 . / average of all \2 . J average of all \2 
points In Column 1/ \ points In Col. 2 / **'* \po1nts In Col.if 


0 * (average of tiie square of all the points In tt« array) 


INTERACTION 

EFFECT 


E average of all average of all x 

all 1 ,i ~ twiints In Ron 1 " points In Col.j * 

I • 0 * (average of the ;^quafe of all j^qts Ip ^ array) 


All ^r^ cp leu latipns Involve /Uie averages of averages, and are pomer ratios. 


All are normalized to the average powir^-. ' 


^e eiulm^lii of the Uiree calcula- 


tions to var1(His types of data structure. For simplicity, data values are 
limited to ±1 and 0. Note that the number of -*-1 values equals the number of -1 
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valuts, so that the zero-mean asstm^tlon Is valid. 

Exawtit It 111 

0 0 0 
-1 -1 -1 

Rem Effect « U)* . i 

Co^mm Effect » i^ . L — = 0 

interaction Effect » 

( 0 - 0 - 0 )* + (- 1 + 1 - 0 )* + (- 1 + 1 - 0 )* + (- 1 + 1 - 0 )* _ « 

^ 3 • 3 • ( 575 ) 

This example Illustrates hom the Row Effect detects long horizcmtal data 
structures , 


Examle 2: 


1 0 -1 

1 0 -1 

1 0 -1 


Rom Effect = 
Column Effect » 

Interactiwj Effect • 




“ 0 
» 1 


(l-O-D* + ^ ^ (0-0-0)% 

(- 1 - 0 + 1 ^* +^( 1 - 0 -^* + ( 0 - 0 - 0 )* +(- 1 - 0 + 1 )! . 0 


This examole Illustrates how the Coluim Effect detects It^g vertical data structures. 
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Exwpit 3: 10-1 

0 1 0 . . 
- 1-1 1 

Ho»# Effect • I 

Col«nn Effect = ' — =0 

Interictlon Effect » .iH-or ^ (0-V3-0)^ ^ . 





This example illustrates how tiie Interaction Effect detects long diagonal 
data structures. Itote that alttioi^ all the sanples use square arrays, this 
is of course not wcessary for ^OVA calculatiors. These examples have also 
iBed infinitely high SNR*s--to clearly illustrate the calculaticm. Evaluation 
of the technique was, of course, c(mducted on low SNR signals, and the tetection 
sensitivity was thus explored as shown in Appendix A3. 9. 
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Ptrf(i^'‘Wf>ce 


The potential of Analysis of Variance as a signal <tetector has 

t^n evaluated by means of computer slimilatlors In a series of 
systematic experiments. The computer simulation Involm modeling the 

noise field as a (10 x 40 x 2) matrix of expcmentlally distributed 
random variables. 40 represents tl» number of fregu^cy ^ni^ls 
(columns) » 10 represents the niMber of Indepenctent time saig»1es (roMs), 
luid 2 represents the manber of observations (data valtm^s) per ^11. This size 
matrix Is that imed In t^e NBIT algorithm for reasons to bd described later In 
this chapter. At least 2 valiMS per cell are necessary for the calculation 
of the Interaction term In ANOVA. This term Is Important for the detection 
of certain signal types ami for the reduction In the size of the residual 
variance. Since the residual variance Is the denominator of the variance 
ratio which Is used for tests of hypothesis, a reduction in It Increases 
the overall sensitivity of (tetection. Although 1<teally, replication siwuld 
be obtained from the slRxiltaneous observation of the event of Interest, 
the sinnilatlons are carried out In the more realistic fashion of letting 
the first row consist of data taken during the first and second seconds 
of observation and each subsequent pair of scans fill the remaining rows. 

Thus a total of 20 1n<tependent time san^les are used. Superln^sed upon 
tte noise field Is a signal of given specification. 

The data, obtained In this fashion, are tten analyzed by ANOVA 
techn1qi;m. The end product of the ANOVA algorithm Is the calculation of 
the variance ratio for rows, coluims, and Interactions. When the variance 
ratio exceeds a preset critical value or throshold, the null hypothesis Is 
rejected and the signal Is detected. By examining the values of the variance 
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ratio tNMter systmatically varylt^ conditlem* in ytKlart^idliif of M0fA*s 
tffftctlvenoss a dttfctM* unfolds* 

The slmlatlon tt^NniMints and dotal led results are described In 
^emilx A3*9. 

Cwclusicws 

The simulation studies Indicate the p(»s1b1e strengths and ne^n^s^ of 
ANOVA as a signal <tetector. For the matrix dli^nslons pr^ently being oMsIteted, 
It seei^ 1(tea11y suited to tin tetectlon of a bi^dbaml pulse. NaiToid>n^ non> 
drifting and slonly drifting signals am also h«nd1ed well. The om real 
failure occurs with the rapidly drifting sl^at (sl(H>e •!}. It Is notable that 
changing the matrix dimensions to a square would certainly Improve ANOVA's 
capability to detect this type of signal, while not degrading the performance of 
detecting the previously mentlcHied t^es. 

Based on the simulation results Involving bit quantization, a reasmtable 
bit size for the data processlr^ using M10VA has been determined to be N * 8, 
as 1$ used In the Implementation. Thus, In all, AMNA presents Itself as a 
significant global signal detector for the SETI problem. No assu^tlrms r^d 
be macie about the type of signal. Once cme Is detected, further analysis Is In 
order. By retaining Information m whidi of the three hypotheses (Row, Colunwis, 
aiKl Interactlore) are significant, tee prcdtlm of 1<tent1f1cat1(») of signal type 
Is significantly reduced. 

3.5 « Oasis System Algorithms 
(htervlew 

The (tesis Signal Detector Is a l^al-TIme Proc^sli^ Systen contalnlr^ four 
basic components: tte Pulse Detector (PD), the Carrier Have Detector (CHD) 
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Mhich 1$ meant to detect narrcMband carrier signals Hiat be drifting slokriy 
In frequency, the l^hmerlcal Battery of IndepeiHlent Tests (MBIT), idildi comiste 
of a co11ect1(Hi of statistical tests followed by a patUm recoi^ltlon algo- 
rithm, the Cluster Detector (CD), that Is designed to <tetect nwr-r«i<tom 
associations among the signal parameters In N-space, This system Is shown 
conceptually In Figure 3.6, 

Data Stream Preparation 

Me are assuming In titls study Uiat two of tte functicms {Mrftnrmed by the 
NCSA are; 

- Quantization level preparation 

- Baseline removal 

Data within the NCSA Is clipped at times both by truncating low order and 
high order bits. This process has been given much consideration by the 
designers of the NCSA via simulations, and, that the quantization levels be 
renormalized linearly, is a requirement of the NCSA Implwnentatlon. Based on 
the simulation studies It Is felt that each quantization level corresponds to 
.3a of the signal power. Implenwntatlon design for this Is given In Chapter 4. 

Baseline romival Is accomplished within the NCSA throigh the use of Its 
low resolution 57 kHz output channel. A region of the sky slightly offset from 
the target to record the broadband spectral responses of the receiver, R(f); and 
this function is then used to normalize true spectra of the target T(f), Nor- 
malization is accomplished In the following manner. The coarse resolution values 
are used to compute 

R.(f) . 

yielding a zero mean normalized (except for local power excesses) spectrum, R'(f). 
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Receiver 



Figure 3.6 - OASIS System Signal Detectors 



This practice Is common and essential In Radio Astronony to remove receiver 
gain effects before observation of their source. 

Pulse Detector (PD) 

The coarse binning stage of the system examines each spectral scan under 
a wide range of resolutions, to search for excess total power that might 
signify a signal of Interest. Although It can be argued that very narrowband 
signals are optimal for Interstellar communication, to neglect the possibility 
of detecting broadband signals or pulses within the data, when such Information 
Is easily accessible, would be a mistake. Such a wideband signal may occur 
In the eavesdropping or the pulse-beacon cases of signal transmission. 

Under the most Ideal condition, when the bln width corresponds to a 
transmitted signal's bandwidth, coarse binning serves as a matched signal for 
signal detection. However, since the signal bandwidth and central frequency 
are not known, a wide range of bandwidths Is used. The algorithm first sums 
the 8 X 10* adjacent frequency bins pairwise In non-overlapping pairs to pro- 
duce 4 X 10* channels times 2 Hz/ channel resolutions which are compared to a 
threshold. This procedure Is repeated again and again until every set of 
2 (0 < k 23) adjacent bins has been combined and tested. Any coarse bln 

which has a total power above the predetermined threshold causes an alarm to 
be sent to the central computer. It should be noted that one scan (one second 
of NCSA output) can be analyzed at all 23 resolutions within one second. 
Simultaneously, an Integrated scan, the sun of all previous 8 l^z x 1 Hz 
resolution scans Is similarly binned and tested for excesses due to repeatedly 
detected pulses. 
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Carrier Wave Detector (CWD) 

Overvi ew - The Carrier Wave Detector (QfID) Is designed to search for 
narrow band signals which may be drifting slowly In frequency due to the 
Doppler shifts. The CWD works on the 2-d1mens1onal array, having dimensions 
of frequency and time. In which a drifting carrier would appear as a straight 
line whose angle of Inclination with respect to the vertical Is less than 45®, 
I.e., a drift rate -1 Hz/sec < r <_ 1 Hz/sec. 

In principle, the CWD evaluates the data by overlaying the array with 
a "mask” which consists of 20 rays, each of which has a different slope. The 
rays all Intersect at a single time-frequency element of the array and form a 
(touble fan (Figure 3.7). A column of 20 consecutively observed spectral 
elements (20 seconds of data arranged one below the other) Is passed under 
the mask so that each of the 8 million single Hertz channels acts as the center 
point of the mask exactly once. For each ray of the mask, the CWD weights each 
k^ row entry (Figure 3,8) of the Incoming data for the way In whicii the ray 
Intercepts the array, and then a Generalized Coherence Value (6CV) Is conv>uted 
which reflects the Intensity and coherence of the weighted data overlayed by 
the ray. The results from each new 20 seconds of data analysis are tested 
against a threshold. If any 6CV exceeds the threshold, the detector sends an 
alarm to the central processor. In addition, all GCV's are accumulated for the 
entire 1000 seconds of observation time. At the end of the total observation 
time, the accumulated GCV's are also compared to a threshold. These accumulated 
values are also evaluated with the coarse binning technique, to search for 
extended drifting features. 

In order to take advantage of the fact that polarization and phase 
Information are provided by the pair of NCSA's, the QilO operates on the 
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Figure 3. 7 - Two DCD Time- Frequency Masks. Both are Centered at Time k, and 
they are at Frequencies j and j-1. 



Generalized Coherence Values (GCV's) described i^ove. For each angle «id 
each center frequency* one GCV Is computed using 20 values* each of which 
Is a weighted entry for that ra(y (Figure 3,8), The weighting process 
Involves both amplitude and phase* and Is disclosed In detail In Appendix A3. 10. 

Numerical Battery of Inctependent Tests (NBIT) 

Introduction - Since there are numerous possibilities for the forms of 
extra-terrestrial Intelligent signals* the signal detection problem can be 
approached In two ways: (a) a set of tests each of which has a very high 
probability of detecting signals of a very specific type; or (b) a set of 
tests that have a lesser probability of recognizing a much more general 
class of signals. 

The Pulse Detector* which searches for pulses of various widths* and 
the Carrier Wave Detector, which searches for narrowband signals at various 
drift rates, are examples of the first approach. However, because the 
characteristics of an ETI signal are purely conjectural, to search even a 
significant portion of the possibilities within the general class would require 
a prc^ibitive amount of memory and/or processing. 

The second approach is exemplified by the numerical battery of 
independent tests (^®IT). The r©IT signal detection system is 
a tv«) stage technique which reduces the data to a set of 9-bit values in 
the first stage, and employs a pattern recognition algorithm on the 
compacted array in the second stage to detect signals. 



Frequency 


Figure 3. 


GCV 


- Weighted Contribution to GCV of Each Time- Frequency Measurwient 
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In the first stage the entire array is siM>divi<ted into blocks, each 
of which is 40 Hz by 20 seconds in frequency-tine space. Each block has 
an 8 Hz overlap with adjacent blocks in order to aneliorate the aliasing 
probl^ between blocks. Nine statistical hypothesis tests are applied to 
each block of incoming data. One bit of the 9-bit nunter is assigned for 
each test, and the bit is set when the null hypoti^sis is rejected. Tte 
first sta^ tests include: total power in the two polarizations; degree 
of polarization; complex amplitude coherence detection; broa<N>and 
coherent pulse detection for each polarization using generalized 
coherent values; 8-hertz pulse detection for each polarization; AHOVA 
analysis of variance for rows (frequency), WOVA 0 )lunns (time) and 
ANOVA interaction, for each polarization; and ^<lness of fit for each 
fK)larization. 

For each of the tests, the null hypothesis is rejected at one of the 
two thresholds (ot-levels). The first level (low-level) threshold is set to 
trigger relatively easily (a = .5, which means that 50% of the time tte bit 
for a particular test under conditions of noise alone will be "on"). The 
second level (high-level) threshold, on the other hand, corresponds to a very 
low a- level in order to ensure a very low false alarm rate for immediate 
central con^juter processing (the threshold vali^s are adjusted to reflect ttie 
results of simulation or actual qreration). The first level threshold is set 
so that the difference between the false alarm rate under noise (a) and the 
probability of detection (1-8) for the signal type for which the test is 
(tesigned is maximized for each algorithm. This criterion is developed further 
In Qiapter 5. 


C-d 
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The 9-bit number for each block In the first stage is stored in a 
"pattern block" whidi is passed on to a tMo-step cluster detection pro(^sor. 
The first step of the processor tabulates a histogram (m tee occurrence of 
various 9-bit values and sets an alarm for tee central con^uter if a statis- 
tically high number of specific "pattern points" (Kcurs. The second step, 
a single linkage cluster seeking algorithm, seard^s the entire pattern block 
for sd>tle signal patterns and sets an alarm if a pattern emerges. 

The tMO-step cluster detection algorithm whidi analyzes the array is a 
n(m-deterministic signal detection scheme, and as such has a great deal of 
flexibility. 

In general the entire fBIT system has several advantages owr more 
traditional sicnal detection systens. The teo-stage nature of the system 
alloMS a "second look" at the data in real time and it employs two independent 
and distinct methods. The meaningful cemvaction of the data in the first 
stage makes it possible to stwe information for the entire search duration at 
a reasonable cost, which is vitally necessary to detect a weak signal of pro- 
longed duration. The NBIT signal detection system is also versatile because 
it is not signal- type specific. 

Figure 3.9 provides an overview of the NBIT and Cluster Detection 
Algorithms. 

For tee sake of conceptual clarity. Figure 3.9 intentiwally omits the 
following details; 

1. A BLOCK overlaps its neighbors by 4 points on each side, so its 
width is actually 40 points, and total content is ®50 points. All 
other quantitative numbers are correct as shown. 
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Figure 3.9 

An Overview of the MBIT and Cluster Detection Algorlthee 


Begin with the MCSA 
output data arr^ of 
8 billion points; each 
point Is four 4-b1t 
numbers. 


S(d>d1v1de It Into 
1?.S million BLOCKS 
Of points each; 
each point Is still 
four 4-b1t numbers. 


f^rform 9 separate tests on 
each BLOCK. Each test 
produces only a pass-fall 
decision— 1 .e. , l-b1t output 
for each test. The ovals 
represent the test 
operations. 


8 million frequency points- 


the NCSA output array 
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1000 time 
points 
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250,0(X) frequeru^ points 


I I 

JLl 


CCMPACTEO 


ARRAY 


I 


50 time points 


Divide the COMPACTED ARRAY 
Into 1250 PAHERN BLOCKS of 
10,000 PATTERN POINTS each. 
Each point is still a 9-b1t 
m«irt)er. The division allows 
for conputer processing of 
each pattern block. 




4 ^ 


200 frequency po1nts^^ 


a PAHERN BLOCK 


io,oop 

k: 


threshold 




50 time points 




HISTOGRAM 


For each PAHERN BLOCK, 
make a HISTOGRAM of the 
number of PATTERN POINTS 
in which each possible 
combination of the pass- 
fall tests occurred. 

There are 2’ possible 
combinations of tests. 

Each combination is 
called a PATTERN 
CLASS. 

(all 


U O. 


li 


s 

Q. 
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PAlfWN CL! 


OOOODOOOO 011001011 

tests failed) (some passed and 
some failed) 



111111111 

(all tests passed) 


For any PATTERN CLASS that 
exceeds the significance thres- 
hold on the HISTOGRAfl, make a 
CLEANED PATTERN BLOCK, that 
maps only the occurrences of 
the corresponding calcinations 
of tests. 



50 time points 


a CLEANED PATTERN BLOCK 


I 


Search the CLEANED PATTERN BLOCK 
for CLUSTERING, using an adjacency 
algoritimi. Remove all PATTERN 
POINTS that are not part of a 
CLUSTER. If the number of 
remaining PATTERN POINTS is 
significant, decide that a 
signal has been found. 


*200 frequency point 


a CLUSTER 


t 

50 time points 


i 


(tote: Some features have been simplified for clarity on this diagram, 

for actual details. 


See the text 




2. There are additional signal detection decisions ma<te at tiro Inter- 
mediate levels: 

a. The 9 tests each have two thresholds. If ti:e h1^-1eve1 
tereshold Is exceeded, a detection Is lan^lately Indicated. 

The low-level threshold 1$ set such that It Is exceeded by 
noise MX of ^e time, arwl Is used to create tee 9-b1t 
PATTERN POINT. 

b. If any HISTOG^ has an extremely high point, a detection Is 
Immediately Indicated, regardless of tee presence or absence 
of clustering. 

3. The 9 tests are actually histogramed with regard to on bits “1" 

and don't care bits not the on and off "0" bits. This allows 

for additional useful test combinations like total power and 
degree of polarlzatlcm and Ignores everything else. Thus there 
are actually 2® » 512 PATTERN CLASSES. 

4. HISIOCaiAM points are not actually chosen for cluster searching on the 
basis of their absolute level. Rather, the available data processing 
power and time are optimized by chcrosing the highest HISTOGRAM bln excesses 
globally over all PATTERN BLOCKS. 

5. The cluster search algorithm detects not only "blobs" as Illustrated, 
but also finds and follows long linear features. 
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•me Individual NBIT Tests 

1, Total Power - "me purpose of this test is to determine whether the total 
power accumulated over a 20-second by 40-Hz block is significantly different 

from that expected from white noise alone. As the input to the NCSA 
is Fourier Transformed, the power of any pulse or wide-band signal is 
spread across the frequency domain. Summing the power over a range of 
frequencies and time may accumulate the power to a detectable level. 

800 

Total Power = K = I (|x.. |* + |yj*) 

' i=l 1 1 

This value is then compared with previously determined first and 
second level threshold values. 

If this power exceeds either of the two pre-determined thresholds, a 
bit is set and sent to the CPU for further processing. Because the white 
noise is thought to be fairly constant, and this test sets a bit approxi- 
mately fifty percent of the time, the first power threshold will only be 
slightly above the noise level (lo). The second threshold, if exceeded, 
immediately notifies the operator, and all information from that scan is 
recorded. This threshold is adjusted as discussed in Chapter 5. 

The type of signal that is most easily detected by this method is a 
wideband signal. The rationale for this argument is that the algorithm is 
an integration technique and since the threshold is rather low, any amount of 
signal in the 40 Hz block will cause the 1-bit to be set. Since most wide- 
band signals are translated into the frequency domain as a set of sine 
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functions, a relatively larc^ number of points In the block will 
contain the signal. When these points are sunned together the power 
In the block will be above the threshold level. Narrowband signals on 
the other hand, appear as approximate delta functions, and as such, 
the signal will be In relatively few points In the block. If the 
power of the signal Is low enough, a narrowband signal may go un- 
detected by this test. However, this algorithm Is not specifically 
sensitive to any signal type, but rather the elevation In power, over 
the noise level, that comes with having a signal present In the block. 

2. Degree of Polarization - A signal emanating from any single antenna 
win be completely polarized. Even when degraded by noise, a sufficiently 
strong signal will have a resulting degree of polarization significantly 
greater than that of noise alone. 

In this test degree-of-polarizatlon Is calculated along each row 
of the 20 sec x 40 Hz block, and con^ared to a threshold value. Oegree- 
of-polarlzatlon Is calculated along rows In order to maximize sensitivity 
to pulses and other signal types that lack the time coherence required 
for detection by the CWD. 

The results of this test can also be applied to the signal recognition 
problem. Signals of high power but low degree of polarization are unlikely 
to ETI signals, while a high degree of polarization Is a hallmark of 
artificial signals. 

Procedure - A general discussion of degree of polarization can be 


found In Section 3,3. For the purposes of this test, the following 


calculation is performed for each row of the current block: 


where 


m 


w 


V /cnfF -i7n"+ 4 
W + ITT 

1 40 1 

• W "iN** ^ “ TO ^1^1* 


— 1 

and xy* = tq I x.y *. 

4U 1 1 

stored thresholds. 


Each calculated value of m Is compared to 


Sensitivity - If an overall false alanr rate of is desired for 
each block, then per row the false alarm rate is o * (Then 

if Oq * ‘S, a * .034.) 

The behavior of the degree of polarization under noise was determined 
empirically. For an integration length of 40, noise has mean degree of 
polarization .176 and standard deviation .078 (based on a sample size of 
200). This is apparently independent of noise power. 

If Oq = .5 is chosen, then the threshold value for bit setting is .32, 
either by determination of the 93rd percentile of the data or by assumption 
of normality. 

Signals of various strengths and durations were simulated, and their 
degrees of polarization calculated. Figure 3. 10 summarizes the results, 
with each point representing an average of 50 values. (Note: signal 
width is the mmiber of bins in a given row actually containing the 

signal, even if they are not contiguous.) 

Examination of Figure 3. 10 indicates that if degree of polarization 

m results from width * A and SNR = B, then for any d the same value of ^ 
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Degree of Polarization 
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Figure 3. 10- Sensitivity of the Degree of Polarization Test 
to Signals of Varying SNR, for Various Signal 
Time Durations 


lumber of Roms Contilning a Signal of Bandwidth 40 Hz 


SNR 

1 
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6 
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9 

10 
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.12 
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.64 
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.56. 
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.92 
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.50 
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.98 

1.00 

1.00 

1.00 

1.00 

1.00 

1.00 

.75 
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.99 

1.00 

1.00 

1.00 

1.00 

1.00 

1.00 

1.00 

1.00 

1 

.98 

1.00 

1.00 

1.00 

1.00 

1.00 

1.00 

1.00 

1.00 

1.00 

>1 

1.00 

1.00 

1.00 

1.00 

1.00 

1.00 

1.00 

1.001 

1.00 

1.00 


Table 3.2 - Probability of Signal Detection by 'the Degree 
of Polarization Test, as a Function of Signal 
Time Duration (number of roMs) and SNR 
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will result from width • dA end SNR ■ B/d. In other words, degree of 
polarization Is proportional to s^y^fflw T dt ' h ' 

Table 3.2 Indicates detection probabilities for a signal of width 
40 occurring In r rows of the block. This table can be applied to signals 
of width less than 40 by using the appropriately factored SNR. (Note; 

There Is close agreement between the probabilities (H)ta1ned from the 
samples and those derived from normality assumptions.) 


3. Complex Amplitude Coherence - "nils test Is designed to detect any 
polarized signal which may be continuous, pulsed, sporadic, or amorphous. 
Tie mnplltudes need not be particularly large, as we are looking for 
persistent coherent signals. 

For each 40 Hz by 20 sec. block of data, we compute 


|A| 




where xjT*' refers to the average cross power In the two NCSA's, x and y 
refer to the power In the x- and y-MCSA respectively. The magnitude of 
|A| Is an Indication of the correlation (or coherence) between the two 
polarizations; for example. If the two outputs are linearly related, 

|A| ■ 1, while if there Is no correlation whatever between the two, 

|A| ■ 0. At the end of processing for each block |Ai Is compared with the 
predetermined first and second level thresholds. 
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4. 6«nera11te<l Cohcranct by Row (A Bro^and Cohtf nt PuIm Datactor) 
Wideband pulses will appear In a string of contiguois frequMcy bins simulta- 
neously, for only one time period. Ihey mey ttien ref^at at some later time. 
Such a signal could be generated at traismltter by a sine time vaiiatlwi. 
Ihls wcMild cai»e Itie received signal phne to have a linearly Increasing 
variation with frequency* at a slc^ Inversely proportional to the bandwidth 
of the pulse. 

As shown In Appendix a 2.1, the MCSA does not Introduce any nK>re than a 
linear relative phase shift between adjacent frequent bins. Therefore, ^e 
original linear phase variation with frequency is maintained at the out- 
put. The generalized coherence value may be computed across frequency rows, 
since It Is Insensitive to linear phase varlatlcms. 

Specifically, the 6CV Is calculated across all 40 frequency bins, 
separately for each row. If the 6CV for any row exceeds the predetermined 
threshold(s) , the appropriate b1t(s) for this test will be set. 


5. 8-Hertz Pulse Detector - This test provides a means for (tetecting 
Infrequently occurring pulses whose widths are a few Hertz. It computes the 
averse power P In an 8-Hz span across each frequency row for each polaflzatldn. 


“i 


1«0 




I ^ 1^1 

8 1-0 ' 


where x^ wd y^ are row outputs for the x-and y-MCSA's respectively. If 
«fty of these results exceeds the first le\^1 threshold, the l-b1t Is set. 
Likewise, If the results surpass the second level threshold, such an alarm 
Is set as well. 
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6 >7.8. Analysis of Variance (ANOVA) - ANOVA can be used to detect several 
different types of signals. For exw^le, assume that a narrowband monochro- 
matic signal Is present. This signal will manifest Itself In the AHOVA matrix 
by Increasing the power In the column that corresponds to Its frequency. The 
presence of this type of signal wrild be detected In the ANOVA by a rejection 
of the null hypothesis of column effects. That Is, the column mean associated 
with the signal would be higher than the column means associated with 
the noise. This difference In mean values is reflected accordingly In 
the colimn sum-of-squares , which In turn results In an Increased F-ratIo 
for colwnns. If the F-ratIo for columns ex^eds a preset critical F-value, 
the null hypothKis Is rejected. Similarly, a broacband signal would 
Increase the power In all the matrix cells associated with Its frequency 
bandwidth This again would result In an Increase of column totals for 
those columns containing the signal, and ANOVA would pick this iq> by 
producing a significant F-ratIo for the cx)1uinis. In the event that a 
broadband continuous signal conv>lete1y covers the number of columns In 
the ANOVA, this would be detected by comparing the total power (which Is 
calculated as one of the steps In MIOVA) with the total power In the 
previously established noise matrix. Thus, narrowband and broadband 
continuous signals would be detected by a rejection of the hypothesis 
associated with columns. 

A pulsed signal which Is broadband and periodic would result In 
higher power reading In the row totals (for those rows where It a(N>eared) 
arwl would result In a rejection of the hypothesis associated with row 
effects. It 1$ apparent that the signal to noise ratio Is a critical 
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parameter In apy detection scheme. Slnwlatlon studies of SWt and MM)¥A 
sensitivity are discussed ^ove In Section 3.4. 

A drifting signal (or a broadbind signal that has higher freqt^cles 
arriving at regularly spaced Intervals of time) would be detected primarily 
by the Interaction term of /U10VA. Significant row and columfi effect c«n 
result as well, depending upon the slope of the path of the signal as it 
tm)ves through the matrix. Slopes greater than minis 1 are detected by 
the row hypothesis. A tall spiked signal that appears son«vhere In the 
matrix would be detected by row and column hypothesis and possibly even 
the Interaction. 

The M10VA tests are conducted for each polarization using the power 
contained in each bln of the 20 second by 40 Hz block. The F-ratlos for 
each of the three tests are compared with the thresholds and the appropriate 
action taken. 

9, Goodness of Fit -This test compares the actual probatf)111ty distri- 
bution of the NCSA output, treated as a quantized variable, with Its expected 
distribution In the presence of Gaussian white noise. It Is a particularly 
useful test since It makes no assumptions about the nature of the signal to 
be detected. 

For this test the complex output of the NCSA is treated as a pair of 
real nun^ers for each 1 Hz frequency bln every secorxl. Each block contains 
800 pairs, so, for the two polarizations, there are a total of 1600 pairs. 

The set of output values for each member of the pair is compared with a 
Gaussian distribution, representing the expected outcon« for white noise 
alone: For any possible output value k, let Tj^ be the theoretically 
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expected niMber of occurren^s of k end let be ttie observed nuiber of 


occurreiitts . 
fhen T 


£ 

k 


- \)* 
— — 


Is nearly a x* (dil-squared) 


random variible with H - 1 degrees Of freedom, where M is the niaber of 
possible output values for k. This value of T is compared against stored 
thresholds. 


Pattern ftecognition - The St^ II WBIT Alqcnrithm 

1. Introduction - A pattern Is sis^ly the description of an object. 

Pattern recognition is c(msidered one of the mtst coonon of activities of 
man, as well as all other living organisms. For extraterrestrial life to 
survive, it too must be able to proves and recognize patterns in ordmr to 
communicate ^out and react to the dangers inherent in its envlnwiment. 

1i fe — be it terrestrial or extraterrestrial •-develops communication cap^ilities, 
the signals used f&r communication must contain patterns distinguishable from 
randomness. At the interstellar coRiminl cation level, ei^rgy radiated acrt»s 
interstellar space must have, if nothing else, distinctive patterns that can 
be recognized as an artifact against the background of natural radiatiwi. It 
is a primary function of the Oasis Signal Detector to recognize these patterns. 

In ^e past, every SET! experiment began with the assunption that the 
hypothetical extraterr^ trial radio signal would take some particular form. 

The appropriate signal analysis and (tetection scheme was implemented, 
the ex(»rin^t was run, arul the negative findii^ could be expressed in 
a sii^le sentence; "No signal of the particular form wd exceeding x 
flux derrsity was found." 
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These restricted searches had the advantage of t^ing relatively 
e^y to mount t but their narrowly defined negati\^ results were never 
very satisfying. Designing the signal detector as a matfiewatical pattern 
recognition system broadens the constraints on the search immeasurably. 

In particular, a mathematical pattern recognition system can take the 
parallel outputs of a number of analytic procedures applied to incoming 
signals and make a single decision based on the location of the point 
in feature space which they represent. 

2. General Cw»i derat ions - Since the NCSA provides a multidimensional 
output (frequency, amplitude, polarization, [^^e, and time), a generalized 
pattern recognition algoritiwi has been deemed a more useful tool than any 
pattern-specific signal seardi method, such as a matched filter. Furthermore, 
given the fact that planetary rotation would cause a transmitted pattern to 
drift across frequencies at an unpredictable rate, the need for a general 

spatial search for patterns is further accentuated. The pattern clearly visible 
in Figure 2. 3a can '’Ot be recognized in any frequency scan. In the figure 
imagine frequency increasing to the right and time downward. If we added 
signal power vertically in time, the signal would again be lost in noise. The 
<mly cue as to its artificiality is its spatial structure in frequency-time 
space. 

Although the human brain with eyes and ears as its input receivers, is 
the most powerful and versatile pattern recognition system we know of and could 
easily recognize a pattern such as the onesshown in Figure 2.3 , it is also 

std>ject to fatigue and hallucinations, it is selective and imprecise, and for 


- 96 - 


our purposes, too costly. The pattern or spatial non-homogeneity shown 
in Figure 3.11 (a) could be interpreted in many ways by the mind, or not be 
recognized as a "pattern" but instead be classified as a random phenomenon 
such as shown in Figure 3.11 (b). 



(a) (b) 

Figure 3.11 


Furthermore, the enormous data rate provided by the MCSA eliminates the 
human operator from consideration as a preliminary realtime pattern recognizer. 

If the raw data were to be displayed in real-time and in standard TV display 
densities, the latest 1000 seconds of accunulated spectra would require a 
console 43 miles long! (See section 4.1) 

Powerful algorithms exist for recognizing well-defined prototype signal 
categories and for clustering (grouping) unknown categories sharing similar 
properties (Andenburg, 1973, Fu, 1976, and Ton and Gonzales, 1974). Some of 
these algorithms have been modified to suit different objectives in a detection 
and recognition scheme. In view of the implementation limits set by research 
time available in this study, we developed the simple yet extremely general 
algorithm described in this section. We feel that more sophisticated algorithms 
could easily be applied to the problem, and suggest study of their implementation. 
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3. Definitions - For sake of clarity and preciseness In the ensuing 
discussion we need to make the following definitions: 

a) A scan refers to one spectrum - 8 Wz x 1 Hz. with 1.6 bits 
per channel. 

5 ) The entire array refers to 1000 seconds of the MCSA output and 
is a 1000 sec. x 8 MHz matrix of raw data. 

c) A block is a 20 sec. x 40 Hz section of the entire array. 

d) The compacted array is a 50 x 250000 matrix of 9-bit numbers. 

e) A pattern block is a 50 x 200 matrix of 9-b1t numbers whidi is 
a vertical strip of the compacted array. 

fj A pattern class is a set of 9-bit numbers with specified bits 
equal to 1 which ctetermlne the class and no constraint on the 
unspecified bits. Thus, both (0, 1, 1, 1, 0, 1, 0, 1, 1) and 
(1, 1, 0, 1, 1, 1, 0, 1, 0) belong to the class (*, 1, *, 1, 

*. *. ** 1* *)i and, also, to the class (*, 1, *, *, *, *, *, *, *) , 
while (0, 0, 1, 1, 0, 1, 0, 1, 1) belongs to neither of these 
classes. 

gj A pattern point Is any 9-b1t number of the compacted array. 

h) A (»11 Is the time-frequency location of a pattern point. Note 
the one-to-one correspondence between blocks and cells. 

Figure 3.12 should further clarify the above definitions. 

4. The Rationale for Cluster Analysis - Cluster analysis is a pattern 
recognition technique which encompasses many diverse methods for discovering 
structure within complex bodies of data. The common characteristic of the 
algorithms of cluster analysis is the attempted grouping of data units into 
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Figure 3.12 - Definitions of the Terms used for Cluster Analysis 
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clusters such that the elements within a cluster have a high degree of 
“natural" association. The two ways we define this natural association In 
our application Is continuity In time and/or frequency of similar pattern 
pol nts. 

We view a pattern point as a point In the compacted data arrey which 
has associated with It the time and frequency coordinates of the cell It 
corresponds to. Each bit of a pattern point Is set by one of the tests 
described In stage I and has value either zero or one. If a signal Is 
present which sets some particular bits for some length of time and/or 
across some frequency width, then the pattern points which have these 
bits set will lie along meandering lines or be groufwd In other types of 
clusters In the compacted frequency- time domain. Hence, we need algorithms 
which will discover various spatial relationships between similar pattern 
points. In order to obtain this objective, we use a two step cluster 
algorithm which consists of a “smart" histogram and a trodlfled single 
linkage cluster algorithm. 

5. Overview - The method by which a pattern block is examined for the 
presence of a cluster Is straightforward and will be described briefly In this 
section, with more detail being given In Appendix A3. 11. 

We recall that a cluster is defined as a close spatial association of a 
nimiber of pattern points belonging to the same pattern class. As an example 
of a cluster consider Figure 3.13 which shows a portion of one pattern block 
Indicating all pattern points having bits 1 through 5 ; i.e. , members of 
class (1, 1, 1, l,il, *, *, *. *). Since there Is a probability of 1/2 that 
any of these bits will be on due to the fluctuation of the noise background, 
t^ie probability of them all being on In the sane pattern point Is l/32due 
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Figure 3.13 - A Cleaned Pattern Block 
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to the background alone. The algorithm searches for deviations from the 
statistical noise field by specifically searching for a grcMip of N 
"on cells" each being within a spatial distance d of at least one other 
member of the cluster. The algorithm detects sudi a pattern In two 
stages. 

The first stage Is called the histogrmn stage. Here a (tedicated 
hardware processor looks at a pattern block ami (teterm1r»s If any pattern 
class has an "excess" of members within the block. Classes having a large 
excess will then be selected for stage II analysis. Furthermore, an 
extren«ly high excess will alert the central canputer. 

In the second stage analysis, a dedicated softeare processor first 
"cleans" the pattern block of all pattern points not In the class of 
Interest. The result of this cleaning procedure Is a 2-d1mens1onal binary 
representation as shown In Figure 3.13. The binary representation, whose 
constituents are also called "cells". Is then combed through and a list 
is kept In the software processors memory of all closest pairs of "on" cells. 

These pairs are deemed to be the likeliest centers of clustering, and are 
used as seeds around which clusters may be recognized. The algorithm that Is 
then performed Is a variant of the single linkage algorithm (Andenburg, 1973, 
and Ton and Gonzales, 1974), which collects adjacent points, that Is, points 
within a predetermined distance d. Into clusters. If the processor finds 
Itself able to extend the cluster spatially In a particular direction or In 
all directions such that a highly significant mmfcer of points are amassed, 
then a cluster has been discovered. At this point, the central computer Is 
alerted and the data pertinent to the cluster, which Includes both the raw 
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data and the cell representation. Is archived and available for display* 

S1mu1at1oi» Mre conducted w1^ high-level language software that were very 
effective. Thee are shown In Figures 3.14 - 3.16. 

The great advantage of t^ls algorithm Is that It Is ^1e to search for 
structure across any entire pattern block, which Is effectively a 6.4 kHz by 
1000 secwid record. The compactlfi cation scheme has bought us the real-time 
ability to do such extensive spatial searches. This capability Is far In excess of 
what Is possible through the use of dedicated processing of an equivalent 
8 billion pixel Image. In adr*H1on the dimensions of phase, polarization, 
and several types of coherence (temporal, spatial, phase) are Included In 
the pattern point compaction scheme. 
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(a) Befora 



Figure 3.14 - A clustar is dattctad and displayad by highlighting calls that hava 12 or mora 
neighbors ^rithin a distanoa of 2 units. 



(a) Bafora 

Figura 3.15 — In this dansaly populatad pattarn class |1. *.*.*•*•*• 
//as discovarad by tasting for calls that ara complataly surroundad 
neighbors within a distanoa of 1 unit. 



Il 


OP.ICINAL* PACe 

black and white photograph 



Figuri 3.16 — To iiolat* a ttnuous faatura. «va first raquira that a call havt at laast 2 adjacant 
naighboa to lurviva highlighting (b). than in a saoond pan (c) wa raquira that tha survivors 
hava thraa naighboring survivore within a distanca of 2 units. 
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Chapter 4 

Oasis System Architecture 
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The centipede was happy quite. 

Until the Toad in fun 

Said, ••Pray which leg goes after which?" 

^id woried her mind to such a pitch. 

She lay distracted in the ditch 
Considering how to run. 

— Hrs, Edmund Craster 


4.1 - Introduction 
Purpose of the System 

The purpose of the Oasis Signal Detector is to recognize signals in noise 
and to identify them in so far as is possible. In addition, the system: 

- Archives interesting data. 

> Allows the operator to become involved in decision-making and to 
introduce signal Identification tests of his own. 

- Provides the operator with visual and audio feedback. 

- Oversees the operation of the entire receiving system, including 
the sequence of events, such as changing to a new target. 

The system has been described in the previous chapter as a set of algorithms 
(Pulse Detector, Carrier Wave Detector, Numerical Battery of Independent Tests, 
Cluster Detector). These algorithms comprise the Real-time Processing System; 
there are, in addition, three other systems in the detector: the Total Obser- 
vation Archive, the Operator Interface, and the Radio Astronomy System, as shown 
in Figure 4.1. Now we will describe the digital hardware implementation of 
these systems. First there will be an overview of the architecture and data 
flow within the entire detector, followed by a more detailed examination of the 
processors, with schematics presented wherever necessary. The chapter ends with 
a summary of the speed, capacity, and cost of the planned equipment. 

The system employs both conventional digitial computers and special 
purpose digital processors to implement the algorithms. The reasons for the 
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parttcular choice of technology is discussed in detail in Appendix A4.1. 

While the selection of algorithm was somewhat constrained by the availability 
of compatible technology* the hardware feasibility stu<iy which accanpanied 
this design project had its largest Impact in placing limitations on the 
sizes of the random access memories avail^le to each processor. 

Architectural Strategy 

The strategy and ph11osq)hy underlying the hardware design which follows 
can be stated rather simply. If proven off-the-shelf components and pro- 
cessors exist to do a task, we use them. Otherwise, as In 1mplen«nt1ng 
several of the computational algorithms, we have found It reasonably straight- 
forward to design custom-made processors, all of which may be constructed simply 
using current or near-current technology. 

When the data rates and computation rates exceed that which Is possible 
with conwntlonal Integrated circuit technology, we use multiple processing, 
and/or parallel processing, rather than resorting to higher speed processing 
(e.g., greater than lo NHz), and thus avoid limnature and somewhat Incompa- 
tible Integrated circuit families. 

Fortunately, the very nature of the MCSA's output scheme (8 megawords /sec) 
allows all special purpose processors located downstream of this data flow to 
operate with a fundamental clock frequency of 8 MHz or 10 MHz,^ which Is 


1 The reason for 10 Wz Is due to the redundant processing of overlapped 
data as discussed In the discussion of Figure 3.9; detail. 1. 
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within the range of TTL technology. Still, there are cases where the number 
of computations to be performed at this rate is very large, and In these cases 
parallel processing Is employed, either with special purpose hard-wired 
devices (e.g.. In the CWD) or else using a set of microprocessors (e.g., as In 
the CD). All processors as well as all the major subunits of the Oasis Signal 
Detector make extensive use of the tediniques of pipelining and double buffering 
("ping-ponging"). Each stage of the pipeline Introduces a delay In the detection 
proMSS, but each stage Is allowed to operate at 8 MHz or 10 MHz. In some 
stages of the pipeline the data Is reduced or "compacted" as well. By this means. 
In the second stage of the Realtime Processing System, the data rates become much 
more manageable. Here, data compaction by a factor of over one thousand (via the 
MBIT routine) reduces data rates to 12.5 kHz and smal ler words , allowing a sen- 
sitive, non-determlnistic search for patterns to take place by way of the Cluster 
Detecting firmware routires of a microprocessor. 

Large Memories and "Second Looks" at the Data 

An Important question, and one studied In depth by the group, was the 
feasibility of maintaining a large online memory containing all of the raw 
data from a target (.1 to .5 terabit). We determined that to store such a 
data base was possible, yet such a data base could not be designed to afford 
convenient random access to the data for preliminary processing. While It would 
have been desirable from both an architectural and algorithmic point of view to 
have a large random access data base through which all processors could comb, 
backtrack, and focus In or out of, such an entity was not found feasible. 

Magnetic disc technology has not reached the storage density or capacity. 
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magnetic tape has not readied the required access rate, and erasable 
digital optical storage Is yet Immature and has not solved error pate 
problems at high densities, nor are ar\y of these coretraints expected to be 
sufficiently lifted within the next five years. 

We did, however, design a large memory In the form of a "Super High Bit 
Recorder Helical Tape Transport", which serves, not as a primary data base, 
but as an archive. In both a short term and long term sense. The data taken 
from every target Is stored temporarily In this "entire observation archive" 
while the dedicated processors (PD, CUD, NBIT) concurrently run deterministic 
tests on an Identical stream of data. The tests are considered deterministic 
In the sense that these tests simply act and report their findings over and 
over again; never altering their behavior In response to their own results. 

After this deterministic search has transpired for a particular target and ar\y 
outstanding results of the tests have been gathered, a "second look" Is taken 
at the data, using the results of the PD, CWD and NBIT processors as a guide 
to examining the archive. This Is shown In Figure 4.2. 

The concept of taking a close second look at promising portions of the 
data after an Initial evaluation has been made Is a natural and convenient 
solution to the large memory problem state above. In this wey, all of the 
data Is made available to powerful, sensitive, and non- deterministic tests, 
such as are provided by the clustering algorithm and the operator's own banki 
of knowledge, skills and tools. Nor Is the concept of a second look new. It 
has often been suggested by those working In the field, 2 
and It emerged independently as well from the Ideas of this design team. 

The data compaction of .1 terabits to .1 gigabits allows the system to 
automatically take a second look at the data. The second look comes In two 

2seeger,Stull, 1979, personal conmunl cation with menbers of the summer study. 

-114- 



First It 

1 

I 

Second 1 




ftealtlme Processing System. 
Processors take a first look at 
all the data, then a second look 
at the compacted data for patterns. 


Operator has all of the raw data 
at his disposal. A seccmd look Is 
taken based on the reports of 
Realtime Processing System. 


Figure 4.2 - Use of the "Second LocA" strategy, with both machine and 
operator getting a second look at the data. The operator 
is used as the ultimate pattern recognizer. 
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foms; the hlstogrwnlng and the adjacency seard^ algorithms of the Cluster 
Detector. 

The hlstogranming tests each 9 bit "pattern point" coming from the NBIT 
detector at a 12.5 KHz rate for membership In any of 512 "pattern classes". 
It can be seen that by compressing the data here we have gained the ability 
to subject It to many more tests. 

The operator Is also given a second look at the data. As will be 
discussed In the next chapter* It Is felt that the operator can and should 
play an Integral role In signal detection since he possesses the most 
advanced and sophisticated algorithms for pattern recognition possible. The 
previewing of data by the hardware processors rather than the operator Is 
necessitated by the fact that the raw data. If presented on a 22-1nch-tall 
standard display, would require a display 43 miles In length. 

Instead, the operator is able to concentrate on alarms from, and possible 
patterns detected by the real-time processing system. To aid him in his task, 
he is provided with exclusive use of three archives, an advanced image pro- 
cessing unit, and a bank of software data reduction routines, as well as aid 
from a radio frequency Interference detector and a knowledgeable radio astro- 
nomer. At the end of the 16-m1nute period, as the system moves on to observe 
the next star, a decision Is made, either by the operator, or by the central 
computer, to save or erase the prior observation. The operator's role as a 
signal detector Is shown in Figure 4.2. Comments on the criteria for the 
preservation for an observation will be given in the next diapter. 


3 

Assumes each 22"x22" area displays 512x512 pixels of 16 levels, and one 
observation consists of 8 10* Hz x 1000 seconds x 2 polarization x (1 real 
+ 1 imaginary part) x 16 levels. 


- 116 - 


AdvantMti of tht Archlfcturt Choitn 

Tha tystam Is highly modular, with littia or no coamunl cation taking 
placa batwaan procMtors sharing tha sami staga In tha pipallna. Input to 
proeass(nrs arrivas at a fixad rata on a common bus. and output Is typically 
limited to tha signaling of an alarm throu^ a priority Interrupt to tha 
central computer. Tha computer acquires Identifying Information regarding 
such a procassmr's alarm by reading tha output register of tha processor 
without establishing any two-way coawninl cation. 

Thera Is great advantage In keeping different functions physically 
separate and communIcatICMrts between mockiles minimal. In addition to providing 
a substantial Increase In computation power through parallel processing, this 
scheme allows for easy modification of the system, through addition or removal 
of processors. Processors may be easily monitored for failure, and the failure 
of one will not bring down the entire system. The central conputer simply 
removes the processor from the priority Interrupt system, and. If necessary, 
terminates the pipeline at this processor. 

Such modularity provides for a great deal of options In the Inq) lamentation 
of the Signal Detector. In Figure 4.3 we show the bttic pipeline, each level 
of which is a set of signal detectors. One may start at the bottom, throwing 
away modules without affecting the performance of the mockiles on the left, 
right, or above. At each turn, one Is left with a functioning signal detector, 
albeit one with reduced sermltlvlty to some class of signals. 

It should be noted that the most fundamental algorithms are located In the 
first two lines of this figure, namely the Pulse. Carrier Nave, and the nine 
NBIT algorithms. Thus, a minimal system configuration would still contain the 
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Figure 4.3 - A Delineation of the Pipeline of the Oasis Signal 
Detector, ^todules depend only on those directly 
above. Each module excluding Array Transposer 
is a signal detector. 
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most sensitive, and 1in>ortant algorithms. Further, any of the particular 
MBIT algorithms may be replaced by any alternative bit-setting algorithm, 
with the entire system above and below this stage of the pipeline remaining 
unaffected. 

An advantage of the system architecture of a mere primary nature Is the 
fact that all components to be described In this chapter are proven, off- 
the-shelf Items. Though some of the hardware Is state-of-the-art within Its 
class, namely the high density discs, helical tape transport, and display 
processor, as well as the Integrated circuits for particular integer nwltlplles, 
the classes themselves are mature In the sense of standardization, rell^lllty, 
economics, and the very fact that corporations suppor*t1ng such products are 
well-established, and reasonably long-lived. Appendices A4.1 and A4.2 deal 
further with the choice of technology, and alternative ways to Implement the 
(KD modules based, for example, on smaller or larger MCS/te, and the use of 
fewer modules. 

4.2 - The Real-Time Processing System 
Input to the Oasis Signal Detector 

The MCSA's bottom stage, consisting of 140 x 56 » 7840 BPF units, each 
with associated RAM memories, provides the input data to the signal detector. 

As can be seen in Figure 4.4, this stage produces, at each RW1 memory, 1024 
output words, each representing a channel of width of approximately 1 Hz. With 
7840 BPFs of 1024 ''channels'' each, there are, in total, 8 million outputs. 

These outputs are produced each second, so that an 8 MHz bandpass Input to the 
>CSA may be totally analyzed to 1 Hz resolution and bussed to the OSD In one 
second. 
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Sampling: 10 Wz Quadrature 0vers«np11ng 


140- Channel 
Band Pass F11t< 


56-Channel Band 
Pass Filter Module 


#140 ^ 

BPF Module 


32-band 
DFT Module 


#56 

X'#l\ 
1 1 

^^#56 

DFT 

DFT 

DR 


2 3 ^ 

il/ • • • vL 


1 2 22 
JaI/ • • • \I/ 


|1 2 32 

vl/vjy • • • J/ 



1024 1 2 


1024 12 


Figure 4.4 - FCSA Architectural Block Diagram, with each Band Pass 


Filters 



During one of these time Intervals, all the RAHs fill In parallel. The 
results of the Prime Factor Algorithm (Narasimha, Peterson, ^nd Narayan, 1978)^ 
leaves the data skewed across the RAM In a fashion simllat to the results of an 
FFT operation (Bergland, 1969). By providing two RAMs at such stages, one 
RM1 may be written Into In a skewed fashion, while from the otiier, data Is read 
out from the previous time Interval In a sorted manner. Upon leaving the final 
DFTs, the data enters the Signal Detector. Exp1an(ted block diagrams of the Q&D 
showing all major components with associated data rates and memory capacities 
are shown In Figures 4.5 and 4.6. 

The data Is read out at 8 Hwords per second, with each word consisting of 
16 bits real and 16 bits Imaginary, ana Is clipped after readout to 4 bits real 
and 4 bits imaginary, then multiplexed (box l^eled MUX In Figure 4.5) with 
the output of the second Identical NCSA operating at an orthogonal polarization. 
This yields a composite 16-b1t word consisting of 4 real bits (polz. X ), 4 bits 
Imaginary (polz. X ), 4 bits real (polz. Y ), 4 bits Imaginary (polz. Y ). As 
discussed In Chapter 3, each quantization level of the new 4-b1t values repre- 
sents .3o of the average signal ampl1tu(te. The clipping Is accomplished with 
an 8-b1t to 4-b1t ROM look up table. 

Array Transposer 

The output of the multiplexer, the 8 Mvords/sec data flow branches In 
two directions. A scan (a one-second observation from the MCSA, 8 million 
16-b1t composite numbers) read out of the multiplexer Is bussed word by word to 
the pulse detector, which performs Its computations Immediately and directly 
on each scan. The scans are also written Into a device known as the drray 
transposer. 

references Chapter 3 
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Figure 4.5 Oasis Signal Detector Block Diagram With 
Rate of Data Flow Between Components 
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The need for this device Is dictated solely by the input requirements 
of the CKO. The CMD operates on nearly vertically arranged slices of raw 
data, as was discussed in Chapter 3, and illustrated in Figure 3.8. Each 
vertical slice covers 20 spectral scans, and therefore pr^ents an unusual 
memory requirement. 

The transpose method involves reading twenty seccxids of NCSA output 
into rt^mory one scan at a time, i.e. , r(»i by row. (hiring the next twenty* 
second interval the data is read out one frequency (one column) at a time, 
i.e., colimm by column. This form of read out is ideally suited to the CWD 
algorithm and requires a very small amount of intermediate memory before 
the result is accumulated for each slice. 

The transpose method consists of a double buffer, one part of which is 
reading data in from the NCSA output and the other part of which is writing 
out to the CWO implementation. Read-in of data is done in a conventional 
manner, where one scan is read in at a time, so that the data of wie scan is 
arranged contiguously around one set of tracks (one cylinder) on the magnetic 
disc. 

The transposer is composed of two disc systems and two RAM systems. The 
identical disc systems are composed of two discs and one controller. The RAM 
systems are two identical 20 x 10® bit RAMs. One disc system reads in from 
the NCSA output while the other writes out to the RAM system. This procedure 
continues for 20 seconds after which the two systems trade functions, with 
the disc which was reading now writing and vice versa. The RAMs operate on a 
similar basis but on much shorter time scales. Each 0.13 seccxuk the RAMs 
trade functions, one reading from the disc while the other RAM is writing to 
the CWD and fffilT algorithms. 
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This double-buffering of systems allon^ for the transposing of twenty 

scans from row-by-row access to column-by-column access In the following 
manner. Data Is written from one of the disc systems to one of the RAMs In 
partial rows. These are 64 x 10* Hz strips of an NCSA output scan (1/130^^ of 
a scan). Within the disc system that Is writing out« there exist two disc 
drives and a ccmtroller. One disc reads out a partial scan to the RAM while 
the other disc seeks out another partial scan, positioning Its read-head 
appropriately. The amount of time required to perform this "seek” Is equal 
to the amount of time required to write out a partial scan. It Is In this 
way that time Is bought to fill one of the comparatively small RAMs with a 
64 X 10* Hz X 20 second piece of the total 20 seconds of data. Once the data 
has filled the RAM, the RAM switches to the write mode and the data Is 
written out, twenty-element column by twenty -element column. This constitutes 
the transposed data flow. 

In the read and write modes described above, all data transfer rates are 
8 X 10® words/sec., where a word consists of 16 bits, 4 bits real and 4 bits 
Imaginary for each polarization. It should be noted that the data rate out of 
the NCSA Is maintained at all times. The net effect of the dlsc-double-buffenng 
Is to deley the "real-time" arrival of the data by 20 seconds as well as to 
transpose it. 

Advantages of the Transposition Scheme 

The transposition scheme Is necessitated by the requirements of the CWO 
algorithm. There are two additional and very Important benefits of the trans- 
posing scheme. The first Is the block-by-block transmission of the rm data 
to the NBIT processors. With this transmission scheme, the 800 raw data words 
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that are used to produce one bit of the 9-b1t number which represents a block 
all arrive sequentially. Hence, none of the partial results of an NBIT test 
on a particular block need be saved after that block Is processed. 

It should be recognized that there Is an alternative processing scheme 
for the fffllT algorithms that Is eliminated by using the array transposer. 

This scheme Is called the "on-the-fly" 1mplen«ntat1on. It consists of 
taking the raw data from the NCSA, one scan at a time, and computing the 
results for all pattern blocks In parallel. This on-the-fly scheme would 
require accimulators for each block during the processing of twenty scans. 
There are 2.5 x 10® blocks being parallel® processed and the amount of 
acciRiNjlator memory required would be approximately 3 x 10® bits. This Is not 
an unreasonable amount of memory, but nonetheless the need for Its presence 
Is con^letely eliminated by the transpose system. 

Pulse Detector Implementation 

The algorithm implemented here, as described in Chapter 3, is an 
examination of the incoming scans under various resolutions ranging from 
1 Hz, 2 Hz, 4 Hz, ... , 2** Hz = 8 MHz. This is accomplished in the 
following manner. First the incoming signal is converted tv*> a total power 
value by sunning the squares of 4 complex components. The conipnnents used in 
this operation are "lookup" ROMs for squaring and MSI integer adders. Then 
24 such adders integrate the data into 24 accumulators. At the end of 2^"^ 
Integrations, the i^ accumulator is compared against a threshold. If there 

5 "Parallel" in the sense that the processor would work on each block only 
1/^ of the total amount required to complete that block, and then move 
on to the next block. Only after 20 visits to each block Is this 
parallel computation completed. 
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Is a significant excess of total power, the tiireshold Is exceeded, and 
an alarm Is sent to the central computer. 

This hardware scheme Is shown In Figure 4.7. The clock sdteme for the 
adders Is simply an 8 HHz oscillator synchrozined with the NCSA readout, and 
this clock drives a 24-b1t counter. Each bit of the counter Is an enable 
signal (e.g., rising edge due to 0 1 transition) to the thresholding 

function of a unit. All adders add upon each cycle of the clock, and, at 
each 2^”^ cycle, the 1^ bit of the counter experiences a rising edge, and 
allows a threshold and accumulator clear to take place. The expected total 
power from 2^ summed channels Is a constant due to the clipping procedure, 
and thus the threshold values are likewise constants, hardwired Into digital 
comparison chips which check for "greater than" situations. 

When such a situation exists In any of the 24 units, an or-ing procedure 
allows a single alarm (one-shot Interrupt pulse) to be sent to the central 
computer's priority Interrupt system. The unit experiencing the excess Inte- 
grated power latches Its own identifying number "1" into a register connected 
to the computer's 10 Interface. An additional pair of registers, the 24-b1t 
counter already discussed, and a scan number counter are also similarly 
connected to the computer, and these three registers serve to uniquely describe 
the location of the anomaly In the ravt data, for the operator Interface system. 

The (tetector, as described above, performs the coarse resolution pulse 
search on one scan at a time. The detector includes, as well, a second, 
i<tentical unit that performs this test on an integrated scan (see Chapter 3). 

This Integrated scan is accumulated at an 8 MHz rate into a large RAM memory 
16 bits X 8 X 10* bits.® At the end of 1000 seconds (1000 scans) this 

6 While RAMs of this size are not manufactured per se. In a serial access 
utilization such as above, there Is little difficulty In stringing 
together many 256K RAMs, without encountering additional addressing problems. 
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Figure 4.7 - Pulse Detector 
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Integrated scan Is read Into an Identical pulse detection unit (with appro- 
priate thresholds), and a separate alarm, with the assi^lated registers (less 
the scan ID register), may be communicated to the con^uter. 

A Hardware Implementation of the Carrier Wave Detector 

As discussed In Chapter 3, this processor Is respwslble for the detection 
of negatively and positively drifting carrier waves (drifting In frequency) of 
driftrate between -1 and +1 Hz/second, and of any Initial frequency within the 
8 MHz band. The detector Integrates for 1000 seconds of data, within which 
50 "slabs" of data (measuring 20 seconds by 8 MHz) are Individually examined 
for carriers, as Is the grand total of the 1000-second observation. See 
Figure 4.8. 

Through the use of the Generalized Coherence measure, the algorithm provides 
an enhanced sereltivlty to drifting carrier waves. By responding to the phase, 
polarization, and amplitude or coherence of a signal occupying a moving 
frequency bln (called a "slice"), the detector Improves Its signal to noise 
ratio by a factor of ~6 over what Is accomplished through simply summing the 
power within t^le slice. 

The processor also searches for "carriers" of arbitrary bandwidth. One may 
think of this either as the search for several nearly adjacent drifting carriers, 
or the search for spectral features of any width (up to 8 MHz) which retain 
their spectral shape and phase, but with a frequency offset In time. The 
detection of such drifting signals Is accomplished by binning the output of 
the carrier wave detector across the axis of the starting frequency positions 
of the slices. 
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IiwDlanentation 

The detection of the Generalized Coherence of carriers presents the most 
difficult processing requirement of the entire OSO, where as many as 200 complex 
Integer multiplies must be done In parallel at an 8 Mlz rate, a rate 
far In excess of what Is required of aiv other processor In the system. 

One further ramification of the Implementatlw of this algorithm Is the 
necessity of providing a "horizontal" data flow to the processor as pictured 
In Figure 4.8. This horizontal flow of data, where data closely associated In 
time and frequency enter the processor together Is accon^llshed by the arrqy 
transposer. 

Given the somewhat formidable equat1(xi for the Generalized Coherence 
measure of a drifting carrier wave (discussed in detail In Appendix A3. 10 and 
shown on page 136 , a con^lete separation was made of 

serial operations applied to each slice of data (with a specified Initial 
frequency In the band) and these operations were arranged In a long pipeline 
with branches. All data necessary to compute one GC value enters the pipe- 
line at an 8 Miz rate In the horizontal manner pictured, and Is moved from 
stage to stage at this same rate. 

Since all of the data within a slab (8 MHz x 20 seconds) originates 
at an 8 MHz rate, all the resulting GC values (8 Miz Initial positions x 20 
slices) emerge from the pipeline In real time. While the workspace of each 
stage Is relatively small (less than 100 words) RAM buffers are considered 
the optimal storage medium for their simplicity. The pipeline, with RAM 
doiAle buffers Is shown In Figure 4.9. 
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Alarm (l,2,or3) 




Figure 4.9 - The 14 Stages of the CWD processor use double-buffered RAMs to 
pass the data from stage to stage. The outputed GC values are 
1) compared with a threshold, 2) binned within a slab and compared 
to a threshold, and 3) integrated and binned with all preceding 
slabs and compared with a threshold. 
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The addressing scheme for reading In the appropriate rm data values for 
a particular slice, as well as the transfer of data from the workspace of one 
stage to the next Is provided by controllers. The first stage controller 
also maps-ln the correct weighting factors for the slice being processed. The 
pipeline sequentially processes 20 slices for en Initial frequency and then 
begins on the next Initial frequency. Therefore the output of the last stage 
an>ears as sequential columns of the slice (vertical) by Initial frequency 
(horizontal) array shown In Figure 4.9. As this array appears, It Is accumu- 
lated onto a 1.3 Gbit disc memory so as to Improve the signal to noise ratio 
of carriers that drift across several. If not all slabs. Some minor address 
computation Is performed for this accumulation process, via the disc (programmable) 
controller. The controller displaces the data from a particular slice such that, 
for example, the GCV of a slice drifting from 1 to 20 Hz In the first slab Is 
added to the 21 to 40 Hz slice result of the second slab. 

An Integration of this type Is performed on each Incoming slab, so that we 
have available at all times the most recent slab, and the simt of all slabs to 
date. This leads to the three tests: 1) the thresholding of each GC value from 
a slab (160 million values), 2) the coarse binning over 24 different resolutions 
of a slab, and 3) the coarse binning of the sum of slabs. The coarse binning 
circuit, as suggested by Figure 4.9, is simply the circuit used In pulse 
detection (Figure 4.7), only now the Ir^ut to the circuit Is the output array 
of the CWD algorithm. The result of binning the accumulated output array Is 
shown In Figure 4.10. It should be stressed that the exwilnatlon of a single 
slab output over 24 resolutions requires no memory, and, that this test alone 
constitutes a very sensitive search for drifting signals. Assigning the carrier Is 
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Figure 4.10 - Detection of a broadband, coherent, drifting signal via CWD 
integration and binnirg is shown. 



observable for 1000 seconds, the 1.3 Gbit disc allows Integratlcw) which 
serves to Increase the SNR by a factor of 7. For this reason, the 
(^tlonal OUO Integration module was placed on the system pipeline (Figure 4.3) 
below the detector Itself. 

The CWD Pipeline 

The following schematics (Figures 4.12 - 4.16) show the stages In the 
pipeline, along with a guide (Figure 4.11) delineating how the branches of the 
pipelines fork and join. Where there are 20 Identical functions occurring 
In parallel, the diagrams show the outline of 20 boards In a stack. The 
reason for 20-fold multiplicity Is that a single slice traverses 20 scans (rows) 
of data and the computations performed on the r»/ data In these rows mey be 
performed 1r. parallel. At some points In the pipeline there are operations 
which require more than one 8 Wiz cycle. These operatiws ther span more 
than one stage of the pipeline so as to be allotted the required time (e.g. 
stage A7 - A8 Is a 5-stage, 16-b1t, cascacte full adder). Also, at some points, 
data Is produced by one stage that Is not needed until many stages down U)e 
line. Such data Is passed along the series of RAM workspaces but Is shown In 
the schematic passing through registers marted "H" for hold. 

Stages 1 & 2 - The Incoming MCSA data are first weighted In accordance 
with the rqy angle, the signal distribution model and position In the array. 
These factors are predetermined and are reflected In the weighting values 
stored In the PROM look-up table, designated by a. Since a ray may cross 
three frequency bins, three weighting values are multiplied by the three 
potential bln contributions. 
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Stages 3-14 - These stages compute the various factors which comprise a 
6CV which is defined as: 


GCV = 


2m 

wirmr 
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(computed at B6-B9) 


and 


m = 




(computed in C4-C13). 


It is pcKSible, therefore, to compute each of the summations separately, multiply 
them together and add the results together at the end. The kj and k .2 components 
are derived in Stages A3 - A9, m is computed in parallel C4 - Cll, the factor 
— — IS calculated in C12. Meanwhile, the other five correlation 
summations are computed in a parallel set of operations in Stages A3 - A9. 

The results are confined in Stages 10 - 14. 
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Notes on Stages 1 & 2 

Fo’^ each 0. there are three complex (4 bit real, 4 bit imaginary) weighting 
factors which include an area conponent and a phase correction component. 

The input values are held in the RAM buffers for ?.5 ysec. During that time 
the ray values for each of the 20 slopes are weighted. One angle is started 
each 125 nsec. 

Not shown is the table wmch contains the displacements from the frequency 
10. , 20. which is used to update the address register for the correct ray 


fa^^or^ 


The displacement is a function of i and 


Figure 4.12 - Beginning of CWD Pipeline 
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COW»LETE COMPUTATION FOR 6VC, 


PyA. 



Convert fraction, then use a multiplier and a shift, 

e>9> N * 20 * ,0526 and g j j -• •0527. 

Figure 4.17 
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Finally, we examine the actual components of the pipeline. We reduce the 
complex mathematical symbols shown in the preceding diagrams to real Integer 
adders and multipliers. The key to the symbols is as follows: 

a complex function 

2 2 

a power computation, e.g. x + y 
a multiply and addition operation respectively 

the circuit involves 4 stages 

and retains words of 16 bits 

Hold register 

16-b1t bus 

change sign bit 

a ROfl table lookup of the stored function (e.g. 745189) 
The means of performing 8-bit multiplication at 8 MHz rates is easily 
commercially available (e.g., TRWMPY8HJ circuit, maximum propagation time 65 ns). 
To do additions at this rate is also no prctolem (e.g., TTL 745182, maximum 
propagation time 36 ns). The 16-bit multiplication units canmercially available 
in 1979 are, in the worst case, a little below the required 8 MHz rate (140 ns 
prog, time = 7.1 MHz), but if this does not improve, there are many alternative 
means of reaching this rate including multiplexing, or ROM lookups. The approach 
favored most by the design tewi is to have custom MSI complex mathematical 
circuits designed by industry, of the functions shown in Figure 4.18 and 4.19. 

This should be the most economical solution, and such circuits, being of a 
fundamental nature, may have commercial appeal in their own right. 
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8-BIT COMPLEX MULTIPLIER 8+8j 
IN; A « Rj + jlj . B « Rg + jig 

OUT; Re(A*B), Im(A‘B) 

MAX RATE: 15 MHz 

MAX DELAY; 100 nsec 




16-BIT COMPLEX POWER FUNCTION 


IN; one 16-»>16j number 
OUT: 16-bit sum of squares 

MAX RATE; 20 MHz 
MAX DELAY; 55 nsec 




COMPLEX ADDER 


IN; 20 (16+16j) nuntiers 
OUT : one 16+16J 
MAX RATE: 20 MHz 

MAX DELAY: 175 nsec 



(Identical unit 
for 

Imaginary part) 
Imaginary 


Figure 4,18 - Some of the components used in the proceeding schematics. 
The values for the rates and delays are derived from 
current off-the-shelf adders and multipliers. 
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SCALAR MULTIPLIER 

IN: one 16-fl6j and one 16 bit real 
OUT: one 16+16J 
MAX RATE: 15 Wz 
MAX DELAY: 140 nsec 




TWO STAGE 16-BIT COMPLEX ADDER 

IN: three 16+16j numbers 

OUT: one 16+16 j number 

MAX RATE: 20 MHz 
MAX DELAY: 70 nsec 



Real 


Imaginary 

Drop LSB 
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lnglementation of the NBIT Algorithms 

The Implementation of this part of the OSD amounts to the Implementation 
of 9 separate algorithms. It Inclutites, as well, a pre*pro^ssor that organizes 
and prepares the data streams which branch out to the algorithms. Thus* 
before the data enters the Individual processors* It passes through the Overlap 
System and the Common Processor as shown In Figure 4.20. By the time the MCSA 
output arrives at the Common Processor* Its ordering has changed several times. 
It emerges from the MCSA scan by scan* from the transpose RAM slab by sld), 
accessed a column at a time* and finally from the Overlap System block by block* 
accessed a row at a time. (These terns were defined in section 3.5) The 
trend here Is toward preparing the data to be passed to the algorithms 
more and more closely associated In the two dimensions of time and frequency, 
not just In one d1 irons Ion of frequency. 

The blocks contain redundant data. Since blocks overlap 8 Hz on each 
side with the adjacent blocks, the overall throughput rate is Increased to 
10 MHz In the system to maintain realtime processing. This Is accomplished by 
the Overlap System, ^text, since many of the NBIT algorithms require the same 
sort Input, such as spectral power values, the Common Processor prepares these 
values at once for maqy algorithms. Each algorithm takes what Is needed from 
the Common Processor* and then carries through the operation with Its own 
particular hardware configuration. 

Before describing the Overlap System, Common Processor, and the algorithm 
Implementations In detail, we will first discuss the role of the NBIT system 
In the context of the entire system. 

It Is the NBIT system which compacts the data by a factor of 1000, and 
reduces the throughput rate from megahertz to kilohertz. It is also the 
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Figure 4.20 - f®IT Detector - Block Diagran 
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system that surveys the data with high sensitivity, noting millions of 
candidate 1ow>1eve1 signal features, and, rather than bombarding the operator 
and central computer with these. Incorporates them Into the compacted arrey 
for further scrutiny* Finally, It Is a collection of very diverse detection 
algorithms, a Battery of Independent Tests, which are used together In a 
"shotgun" approach, with the thought that an arbitrary signal type will be 
caught by one or more of these tests. 

The NBIT detectors are constructed so as to be able to contact the central 
processor directly (through an Interrtipt "alarm") If a block of data passes 
a particular test with a highly significant score. This direct line to the 
central computer Is a common feature In the Oasis System. Each processor has 
one, and they are designated by triangles in Figure 4.5. This symbol next 
to the NBIT system Indicates there Is avall^le to each of the nine processors 
a high threshold alarm channel to the central computer, (The thresholds for 
these high level alarms, and the entire strategy regarding false alarm rates 
and archiving, are discussed In the next chapter).) 

The term "low threshold alarm" In Figure 4.20 refers to the fact that each 
block of 800 numbers is reduced by an NBIT algorithm to a single bit based on 
a COTiparison with a threshold. In order to provide the maximum in sensitivity, 
as explained in Chapter 3, this bit is set whenever the test score for a block 
is greater than the mean value resulting fran a block of noise alone. (Thus, 
in the absence of signal, the low threshold alarm is triggered by noise half 
the time.) The collection of low threshold alarms then is not transmitted to 
the operator, nor to the central computer, but becomes the compacted array, 
and is stored in Pattern Block Replay section of the Cluster Detector. 
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since each of the Individual test processors obtains a block (800 numbers) 
of data at a 10 HHz rate ( sped>up to offset the Inclusion of redundant data 
due to overlapping blocks), and each turns a block Into a single bit, the bits 
flow out of the processors at a 10 MH 2 / 8 OO = 12.5 kHz rate, into the Cluster 
Detector. 

The Overlap System, the Common Processor, and each of the NBIT processors 
are relatively simple and memoryless custom-made units, built from the same 
components as the CWO, though, for the most part, far fewer. In one test 
(Goodness of Fit) a pair of microprocessors are also used to provide floating 
point divisions which retain a large dynamic range. Otherwise, the same 
integer adders, multipliers and small PROM look up tables are employed. 

T)e schematics of the Generalized Coherence by Row, and the Degree of Polari- 
zation tests are found in Appendix A4.3, and are a much simpler version 
of that of the CWD processor which has already been presented. 

The implementations are memoryless in the following sense. With the 
array transposer system, we have contrived to arrange the data block by block, 
so that once a block has been tested, and the low and possibly high alarms are 
recorded, the working registers of an NBIT test may be completely cleared. 
Without the array transposer, the data would arrive scan by scan, and partial 
test values of the 250,000 blocks that comprise 20 scans would have to be 
stored in accumulators for each test. From an economic viewpoint we note that 
to provide these (RAM) accumulators would be cheaper then providing the array 
transpose system. But the transposer is included to make possible the detection 
of drifting carrier waves, and that it also allov-«s a memoryless NBIT system is 
simply a side benefit. 
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To allow Uie low threshold 9-b1t number (representing the results of 
the nine simultaneous tests on one block) to be syndironously transferred, some 
pipelining Is necessary. Those tests with several stages (e.g., 6VC by row) 
Incur a delay In the production of their bit, so that the other quicker tests 
must provide a pipeline of hold registers to match this ctelay. Such delay 
is on the order of milliseconds. 

The Overlap System 

As shown in Figure 4.21, data (In slabs, column by column) feed from the 
20 Mbit RAM to both the CWD and the overlap RAMs. At all times two of these 
RAMs are set for reading In at 8 MHz, and the third is writing out a 10 MHz. 
Each RAM holds 800 16-b1t words (40 Hz x 20 seconds). In order to overlap 
the blocks by 8 Hz on a side, the last 8 columns read into one of the 8 MHz 
read-in RW4S, are also the first 8 columns of data read into the other. 

When the first is full it becomes a 10 MHz read-out RAM, and the functions 
of each of the three RAMs are rotated. The 10 MHz readout is row by row. Not 
shown, but rather trivial to implement, are the details oi the three-phase 
counter (which counts once each 800 8 Wz cycles, and controls both the Read/ 
Write function of the RAMs, and the clocks selection to the RAMs. Also not 
shown is the addressing sequence (stored in PR(^) to read out the data in a 
different order than the sequential readin. 

The Common Processor Implementation 

This processor, shown in Figure 4.22, uses previously described components 
to prepare data streams for the NBIT algorithms. It also aids the "6VC by rows 
test" by preparing a delayed signal for computing the autocorrelation functions. 
All preparations are done at a 10 fWz rate in parallel as shown. 
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Oise System 



Figure 4.21 - Overlap System. Data is read in to the Overlap RAMs 
column by column at 8 MHz, and is then read out at 
10 MHz, with 25% of the data read out twice. 
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8-Hz Pulse Detector Implementation 

This detector (Figure 4.23) computes an 8-channe1 running sian across 
each row of a block. The total power is compared against a high and low 
threshold at a 10 MHz rate. Any occurrence of excess power will latch a "1" 

(low threshold) and will possibly send an interrupt (high threshold). A 
counter disables the shift register at the start of a new row when less than 
8 values are available for summing from the shift register. 

Total Power Implementation 

After 800 accumulations, the total power of the block is compared against 
the two thresholds (Figure 4.24). Although this is by far the simplest and 
cheapest detector of the entire Oasis system, it is considered of fundamental 
importance, as it is sensitive to constant power within a 40 Hz (medium-sized) 
band. 

Goodness of Fit Implementation 

Here the signal's real and imaginary amplitude components are checked for 
obeying Gaussian statistics. Since, for a 4-bit nurrt)er, there are 16 quantiza- 
tion levels, these values are decoded (Figure 4.25), and the nunter of times 
each quantization level occurs in a block is accumulated in ripple counters. 

The counting process can occur easily with conventional ripple counters at 
20 MHz rate, so that each such unit can handle the real and imaginary part of 
one polarization. 

At the end of the accumulation of the 800 values, a pair of microprocessors 
perform the Chi-Squared test determining the significance of the deviation of 
each level. Ripple counter "k" is initialized with -Tj^ so that the microprocessor 
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Shift Register {10 MHz) 



Figure 4.23 - 8-Hz (Narrowband) Pulse Detector 




Data is accumulated for 800 points (40 Hz x 20 sec block of data) then sent 
through a thresholding device. The threshold device (magnitude comparator) 
will decide if either a high-level or low-level alarm has been triggered. 


Figure 4.24 - Total Power Detector 







Figure 4,25 “ Goodness of Fit for one polarization. The other polarization is 
handled by an identical unit. 




need only perform a square and a divide for each k » 1, 16. With only 
(10 MHz)'^ X 800 * 80 psec to perform 16 such sequences, a look up table, 
combining the squaring and division by a constant operation into one read 
might be required to provide the sufficient speed. 

Complex Coherence Implementation 

The Common Processor provides this detector (Figure 4.26) with 
Re(XY*), Im(XY*), Pj^, Py, so that only accumulators and look up tables are 
required to easily implement this algorithm. 

ANOVA Implen^ntation 

The implementation of the ANOVA algorithm computes the equations listed 
in Table A3. 8.1. Ideally, the analysis of variance used in this study (a 
two-way ANOVA cross classification) requires a matrix of R rows and C columns 
in which bvo statistically independent sets of measurements of the same target 
are contained in each cell of the matrix. Because time is a factor in the 
analysis, to simultaneously obtain a second set of independent data would 
require a second antenna of the same polarization (not simply a second 
polarization). 

A suitable substitute for a set ol independent replicates has been shown 
to be a staggering of each second's output of the NCSA. The first second's 
scan fills the first half of the first row of power values, while the second 
second's provides the replicates. Similarly, the third and fourth seconds' 
scans comprise the second row of the matrix. Despite the displacement in 
time of these signals, simulations have shown this approach to be acceptably 
effective (see A3. 9, Simulation Studies). 
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As shown In the table, the quantities MSR, MSC, MSI and MSE 
require computation. It Is not difficult to expand these four summations 
so as to obtain: 


MSE = 1 (A - i E) 
rc n 


(D - F 

' nrfc-V) 

• nfr- l TCcl ) ^ t ^ 0 

where A, 8, C, D, and E are defined in Figure 4.27. By expressing the MS 
values in this way, the entire computation may be performed in the parallel 
pipelined manner shown. Here, we have also taken advantage of the dual 
availability of row and column outputs of each block provided by the MBIT 
common processor. The data flow rate in Figure 4.27 is divided down at each 
accumulation point indicated, while the bit width of the busses and accum- 
ulators is increased. 

At the conclusion of each block input, *;he values A, B, C, 0, and E are 
syncronously buffered from their respective accumulators into the micro- 
processor i;hich computes: 


e MSR . 5 C - 1/10 B^ 


. MSC _ 20 0 - 1/40 B^ 
C ‘ MSE ' 3? a: - 1/2 E 


c MSI 200 E 

'^i mst m" 


- 1/20C - 1/lOD 1/400 B^ 
A - 1/2E 


where we have substituted from above, setting r = 10, c * 40, and n = 2. 

A simple comparison circuit then compares the results against high and low 
thresholds, and, after oring with’the results of an identical circuit for 
the orthogonal polarization, deposits the three bits in the NBIT word. 
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Figure 4.27 ANOVA Implementation showing hardware for one polarization. 
(Second polarization implementation is identical.) 

















Cluster Detector Implementation 

The reduction In data rates (8 MMord/sec ■♦12.5 l^ord/sec) allows for 
sophisticated* non-determlnlst processing at low cost. We can now use the speed 
and versatility of a computer to comb through a condensed version of the entire 
observation. The histogramming (checking for excesses of low-threshold bits) 
takes place as the observation proceeds, and as each 9-b1t value Is output for 
NBIT. At the end of the observation the histograire are con^)leted and the 
cluster detection (via m1croprocesso»^roceeds. The significant results of this 
algorithm are transmitted to the Central Computer during the observation of the 
new target. 

The steps of the process are described In detail In Appendix A3. 11, and 
are summarized below. 

The Histogram Circuit 

This circuit prepares a "histogram" for each of the 1250 
pattern blocks comprising the entire array. The histogram is a nunter count 
of the 9-bit outputs of NBIT, according to occurrence of each output type 
("pattern class"). There are 512 such types, and these types are not mutually 
exclusive. Each output of IS IT is thus subjected to 512 tests, and the imple- 
mentation for these tests is shown in Figure 4.28. A 9-bit number appears for 
(12.5 kHz)"^ » 80 usee and 512 "mask tests" are conducted at 512 x 12.5 kHz * 
6.4 MHz rate. The test is for the presence of I's, with the other bits being 
"don't care" bits. The "don't care" bits are represented by O's in each mask, 
so that the Boolean operation is (ff^V M^)a (^ 2 '^ ^2^ ^ ^ ^^9^^ ^9^ 

resulting in True (1) whenever all I's In the mask M appear as I's in the 
NBIT nutT*>er N. The dock runs at 2 x 6.4 IWz * 12.8 MHz so as to give the 
histogram RAM both a read and write cycle for each accumulation. The histogram 
circuit, due to the nature of the input to the ICIT processor, (a row of blocks 
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the histogram for that pattern block. 








at a time) processes a row of all pattern block histograns at a time. Thus 
the address logic in Figure 4.28 performs a read and write by rows of conse- 
cutive pattern blocks. 

As the pattern points (NBIT numbers) arrive at the CD system, they are 
simultaneously stored in ?n additional memory shown in Figure 4.5 as the 
Pattern Block Replay. These 4 RAMs constitute an identical pair of dottle 
buffers. One pair is for the storage constantly taking place during an obser- 
vation, and access by the CD processor. The second pair is for storage and 
access, but this time, by the operator, who lodes at interesting pattern blocks 
found by the CD processor and Histogram Generator and stores these blocks on 
tape. 

Cluster Detector Microprocessor 

In this section of the CD system, control is transferred to a microprocessor. 
For simplicity, the system includes only one processor, but it will be shown here 
how, at this stage in the data flow, large gains can be bought at a very small 
price by including additional processors conducting nearly identical operations 
tc the one detailed. 

The tasks of the microprocessor are threefold: 

1) Accessing histogram results; sending high-level alarms, and 
using results to establish a potential pattern. 

2) "Cleaning" the pattern blocks tc be searched. 

3) Subjecting the pattern block to the adjacency algorithm. 

The first task above occurs once at the start of the detection process. The 
microprocessor has the list of expected histogram number counts that would be 
found from noise alone (shown in A3. 11). It notes pattern blocks possessing 
significant excesses. If ♦■he count is highly significant, a high-level alarm 
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Is sent to the Central Computer. Otherwise, the most significant 1000 excesses 
are noted In a list In the microprocessor's nefflory. (It retains 1000 "slots" 

In memory, and as the histogram contents are read out, the least significant 
values from either the slots, or from the current read-out Is discarded In a 
"quick sort" fashion). At the end of the histogram read-out, the microprocess 
has a list of 1(XK) pattern types, each related to a particular pattern block, 
that it will perform tasks #2 and #3 on. The sorting of the top 1000 such 
pairs should take: 

1250 histograms 

x512 bins per histogram 

x-lO slote to search through till minimian is found 

X 5u seconds per compare and 'I' card 

32 seconds 

On the average, only ^10 slots need be checked before a histogram bin value 
is discar(ted, because we are selecting only the most interesting 1000 pattern 
classes out of 1250 x 512 candidates, so that most histogram bins will quickly 
be determined to be uninteresting (non-significant). 

Since, as shown above, we have ised 32 seccmds out of 1000 seconds available, 
968 seconds remain to "clean" the blocks and find any patterns. 

The cleaning operation amounts to arranging a bit (cell) pattern in the 
microprocessor's memory that contains I's wherever the interesting (excessive) 
pattern class occurs within the pattern block, and a 0 where it does not. Thus, 
in the microprocessor's memory we retain the two dimensional position of the "on" 
cells. 

Such a progran was written in a high-level language (BASIC) simulation to 
produce Figures 3.13-16 and its low-level language equivalent might be: 
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ysec 


r BACK: LDM. PNTR. 1 

ORI. MASK, 1 

ISZ, PNTR. 1 

loop 10.000 times STI. PNTR. #0 

BRZ. PNTR. OUT 

1. »HOP. BACK 

OUT: 


get pattern point 2 
is it in class? 2 
yes, leave cell on 1 
no, off, go to next 2 
done?, leave 2 

1 


10 ysec total 


Using a mock-asseinbly code we have conservatively estimated the run>time of 
a microprocessor to clean an entire pattern block is 10.000 x 10 usee = .01 sec. 
Thus, after cleaning each of 1000 blocks we still are left with most of 1 second 
to seach each for patterns. 

As a good example of the firmware programming needed, we choose the most 
time-consuming (with half the cells on by noise) adjacency algorithm; one that 
checks and counts nearest cell neigM>ors for complete occupancy. 

A cell's survival to the next generation depends on it being totally 
surrounded. The surviving cells are counted, with a large count indicating the 
presence of a cluster. This would correspond, in the notation of A3. 11, to 
requiring 8 cells be on within a distance of 1 cell. 


Again, such a computer program was used to produce Figure 3.16. The mock- 


assen^ly language equivalent would be: 


usee 


j>START: 
10,000 times 
NEIGHBORS: 


T 


8 times 


DONE: 

-ALL DONE; 


RIGHT: 

UP: 


LDM, 

PNTR.l 

get cell 

2 


IBZ, 

PNTR, START 

cell empty? get next 

1 


CLR, 

I 

set up index 



TRR, 

PNTR,PNTR2 

get centered on cell 

A 


ADD, 

PNTR2,UP(I) + 

move up, down, or none 

1 


ADD, 

PNTR2,RIGHT(I] 

(+ move right , left, or none 1 


ISN, 

0PNTR2 

neighbor on? skip 

1 ’ 

, X 8 = 56 

BR, 

START 

off, get next cell 

1 1 


TBRB, 

2. 1, DONE 

8th neighbor? done 

.51 


-BR, 

INC 

NEIGHBORS 

COUNTER 

no, get next 
record survivor 

1 


ISZ 

PNTR, ST ART 

next cell? 


• 

# 


Total 

~6T 

"usee 

• 

-1. 

-1, -1. 

0, 0, 1, 

U 

1 

200, 

0, 200, -200, 200, -200, 

0 

-200 
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We can conservatively traverse the neighbors of each cell In a cleaned 
pattern blodc In 62 psec, so that the nwnber of such surviving cells In a 
pattern block could be counted (COUNTER) In 62y x lO.CKK) cells * .62 seconds, 
and thus In 620 seconds for all blocks. So, with a 1000-second budget, we can 
firKl 1(K)0 Interesting pattern-point excesses, clean those blocks, and ccxjnt 
the cells surviving an adjacency search In 662 seconds. Those excesses are 
then relayed In a high-level alarm to the Central C(xnputer. The operator. If 
he wishes, can view such pattern blocks on his Image processor, or can automa- 
tically let the most significant ones— I.e., with the m(st survivors— be 
automatically stored on the Special Interest Archive. 

The algorithm just presented Is good at finding blc^s, or dense amorphous 
clusters. It can be seen from careful study of Figure 3.16 that extended 
sparse contiguous structures may be revealed through repeated, more relaxed 
survivor criteria. If we allow the case of many survivors through relaxed 
adjacency requirements in one pass, a second pass requiring survivors to have 
surviving neighbors would then select such patterns. This would require the 
Inclusion of more mlcrc^rocessors, and would gain the ability to detect 
filimentary structures within the pattern blocks. 

In conclusion, to quote one member of the group, "signal detection and Iden- 
tification hinges on a good pattern recognition scheme". While the implementation 
here represents a somewhat basic technique, the foundation for easy computer pro- 
cessing has been provided, and, at this stage of the pipeline, any number of 
sophisticated software schemes could be brought to bear at little additional 
expense . 
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Concliclons Concerning Realtime Processing 

This completes discussion of the realtime processing system. All units 
requiring special hardware or softcfare have been fully laid out so as to 
demonstrate tiieir feasability. Opti(Mis have been given here and in A4.2 
regarding implementations of all or part of the proposed system. Problematical 
aspects 0 ^ the design have, in all cases (we believe), been explored and 
solved ravher than omitted or ignored. Those details that were omitted are of 
the banal sort, well within the domain of conventional digital design. 

4.3 - Archive System Implementation 

An essential part of the observing system consists of the ability to play 

5 

back an entire 10(X)-scan observation of a star during the observation of the 
next star or possibly at some later time as an archive. This requires two 
high-speed recording systens, double-buffered for simultaneous recording and 
play back, as well as archiving. 

The most inexpensive systems capable of handling the data rate (128 Mbits/ 
sec) and the storage requiren«nts (.1 terabits/star) were found to be hi^-speed 
helical tape drives. The optimal 1982 version of one of these drives has a 
storage capacity of 1 terabit. This is rather useful in that it eliminates the 
need for tape changes for every new star being observed. As many as 20 stars 
may be recorded before one observation must be rewritten over, or else the tape 
be changed. The decision to rewrite new data over old data is actually tantamount 

5 The processing of 1000 scans takes place during the taking of the same scans, 
and continues on during the taking of the next 1000 scans. It is assumed 
for simplicity in this study that each 1000-scans corresponds to the total 
observation of one star. Both shorter observations, and longer observations 
(lOM scans followed by a second observation) could be easily accommodated 
into the present scenario, in 20 second increments. 
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to the decision to erase data or to archive It. Since one tape drive 
stancte available for operator perusal during ead) new observation. Information 
from the alarms that were triggered during the realtime processing may be used 
to re-examine the raw data during the 16 minutes - 1000 seconds the tape drive 
Is available. At the end of this period, a new source Is observed and the 
ta(^ drives switch function. If the decision to save the data ra^er than 
write over It Is made, then the new data Is recorded further along the tape, 
leaving the data from the previous star Intact. After 10 stars are archived 
on one tape, the tape must be removed and a blank one spooled up. Commercial 
specs for these drives that will suit the need are mentioned In Appendix A4.1. 

The total observation archive stores Its data block by block (again, as 
a benefit of the array transposer). This enables the of^rator, when replaying 
blocks on this sequential access device, to rapidly view and store adjacent 
such related data, rather than waiting for the drive to seek and read out each 
row of a block separately. This Is of great advantage, for this sort of high 
bit read-out drive has long start and stop times (Appendix A4.1) and the 
(H^erator has only 16 minutes to decide whether an observation should be 
permanently archived or not. 

4.4 - Operator Interface 
Central Computer 

The overall operation of the Oasis signal detector Is controlled by the 
software operating syst«n of the central conputer. In this section we discuss 
Its major functions and specify the minicomputer configuration that carries 
them out. 
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Threshold Setting 

Each test in detector yields a value which is then compared against 
a threshold. The threshold is optimized for the most feasible balance between 
false alarms and missed signals. Threshold setting can be carried out in 
either of two modes. In the automatic mode, the CPU polls the response of the 
various tests while the receiver is observing a quiet patch of sky, or a 
white noise source. Under these conditions, the sensitivity of a particular 
test can be adjusted so that noise does not cause an undesirable number of 
false alarms. These levels can be adjusted regularly during the search while 
the antenna is "off target". Alternatively, the operator may desire to 
manually instruct the CPU to increment or decrement a particular threshold as 
desired. Each threshold is stored at a particular address and communicated 
to tests via an I/O port. 

The threshold setting function of the CPU provides a means of system 
self-checking and calibration. In the event a part of the system malfunctions, 
the CPU can be of utility. For example, if a particular test consistently 
triggers when only noise is present, the CPU will disable that test and 
notify the operator. 

Detection Handling 

After thresholds are exceeded and alarms (or "detection flags") are set, 
the signal is sent in the form of a priority "interrupt" and the CPU is charged 
with responding to those. The detector setting the interrupt sets an identi- 
fying bit in a CPU register and transfers other relevant data via an I/O inter- 
face. 

At this point the CPU consults its RFI catalog for possible information 
bearing on the detection. The catalog consists of two classifications of 
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signals. The first Is "natural" signals which have alreac^ be«i cataloged 
In various radio astrononiv observations, and previously -known RFI; frequency, 
amplitude, time of day and/or position bounds for these phenomena are 
recorded. Also, the local RFI receiver (^namlcally updates the catalog. 

Local RFI signals are saved for 25 seconds, long enough for tt»e rest of the 
system to utilize them. The maintenance of and referencing to tills catalog 
represent a considerable portion of the CPU's real time responsibilities. 

After a detection Is compared against the RFI catalog and Is found to 
not match any previously-known signals, nor have been concurrently received 
by the RFI antenna, the CPU performs one or more of the following; 

1. Alerts the operator (only If desired). 

2. Informs the display CPU to make the block available for audlo/visual 
scrutiny. This can also be done manually, by the operator. 

3. Prepares for the automatic storage of the block onto the special 
interest archive, when the total observation archive Is free. 

4. Records Information on the detection In the system log. 

5. Flags total observation tape record for saving. 

System Log 

The CPU maintains a system log, on standard magnetic tape, which 
includes Information defining parameters of tiie search In progress, system 
status, and all significant events and detections. 

Self- Diagnostics 

Each subsystem of the Oasis detector possesses a diagnostic processor 
capable of verifying correct operation by supplying special test data and 
examining the computed results. Diagnostics are performed during Intervals 
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between star searches on command by the central CPU. Each diagnostic 
processor notifies the central CPU of the result of its test; the aggregate 
of the tests is reported to the operator via a display of the system block 
diagram. 

The diagnostic processors are microprocessors with programs in ROM. In 
some cases, test data are also stored in ROM. In others, test data are 
generated by the diagnostic programs or by special hardware devices as, for 
example* with the array transposer diagnostic systems. 

The Oasis system oi^ration will in general be degraded but not precluded 
by the failure of one or more subsystems. The usual result will be to render 
one or more detection flags meaningless. The central CPU or operator can in- 
hibit the setting of such flags simply by setting thesholds high. 

Verification of the control CPU itself must of course be performed at a 
different time. Diagnostic routines for the CPU, its I/O devices, and the 
graphics system, are supplied by their respective manufacturers. 

Other Functions 

Other system service functions the CPU performs include: initializing, 

resetting, on/off control of the system. As part of the initialization the 
CPU can infonn interested algorithms of the length of the ensuing observation. 
These functions also have manual overrides. 

RFI Receiver and Catalog 

This subsystem consists of log spiral and vertical antenna, a preamplifier, 
and a remotely-progranmable spectrum analyzer interfaced with the Oasis control 
CPU. The spectrin analyzer is capable of locating spectral peaks or sweeping 
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the spectrum with a variety of resolutions and rates. It Is used by 
control CPU to maintain a dynamic catalog of probable RFI signals. A 
commercially- avail able analyzer with 1024 channels Is used for this purpose. 

Other RFI data can be entered Into the catalog from tape (via the system log 
unit) or manually by the operator. 

The catalog In Its complete form Is stored on the catalog disc. In 
addition to frequency, any available Information on amplitudes, times of day* 
antenna coordinates, etc., are Included. An abbreviated catalog of fre- 
quencies only (for quick checking) Is In the control CPU memory. 

Entries are static or dynamic. The former are entered via magnetic 
tape. Dynamic entries are si4)ject to review and removal by the RFI receiver. 

The central CPU consults the catalog In the course of processing detections, 
to determine validity. It has the capability of coin>ensat1 ng for any Doppler 
correction applied to the observatory local oscillator. It can also cause 
the spectrum analyzer to display for the operator portions of the spectrimi 
surrounding any specific detection. 

4.5 - Radio Astronomy System 

The Radio Astronomer Is given his own station. The output of the coarse 
or fine resolution stages of the MCSA are made available to this system so as 
to provide bandwidth for varying applications. Note that stokes parameters are 
computed as shown in Figures 4.30 and 4.31. The use of this Information Is 
discussed In Chapter 5. The observer also has a 256-Mbit RAM to enable 
quick sibtractlon and comparison of observations. Standard observatories 
(depending on location) may make up the post-processing backend shown In 
Figure 4.29. 


- 173 - 



- 174 - 






Figure 4. IK) - Prescalar ( is computed in the manner 
shmim in figure 4.15) 
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4,6 - Gross Attributes 


Storage Capacity; 2 Terabits Online 

At all times the Oasis System has a 2-terab1t online data store. This 
database contains the latest 20 seconds of observations (stored on disc)* as i 

well as any previous 4-1/2 hours of observations on two high speed high r 

density helical tape transports. Such transports are currently conmercl al ly 
available today (Ampex "Super HBR")* with each tape holding one terabit. 

Included with this tape drive, the Oasis system will support a tape library 

? 

where 20 tapes will store the observations of hundreds of stars, at .1 tera- 1 

bit per star (1000 seconds of observations). 

Since the Oasis system Is primarily a real time processor with the 
vast majority (95-99 %) of all Input data discarded for lack of Interesting 
features, it is expected that an entire stellar dsservatlon would be archived 
in the tape library only for exceptional reasons. Therefore the system 
supports a collection of smaller random access memories for Intermediate 
computation and processing. There are approximately ten Independent "working" 
memories in the system, each a solid state dynamic RAM, with total storage 
capability of 1.1 gigabits. 

In summary the system supports: 


2 terabits. Tape, 

Online 


40 terabits. Tape, 

Offline 

- 

7.8 gigabits. Disc, 

Onl ine 

- 

-1 gigabit, RAM, 

Dedicated 

* 


The role of these memories was delineated in the description of r^mory 

r 

■ stream preparation as well as In the discussion of the individual algorithms 

and archives. 


I 

I. 

I 


- 176 - 


Brief Inventory of Major Componente 

The following list Is Intended to give the reader a feel for the variety 
and scale of the hardware required to Implement the system. 

2 Super High Bit Rate (HBR) Tape Transports 
4 1,3 Gbit Parallel Transfer Disc Drives (remov^le heads) 

1 Hinicomputer/Dlsplay Processor Unit 

1 Minicomputer, referred to as "Central Computer" In this report 

7 Special Purple Hardware Processors- TTL at 10 MHz clock rates, 
varying In complexity froti 1 to lO's of Integrer multiplies 
and/or adds per cycle, with pipeline architecture 

1 Hardware Processor of considerable ccxnplexlty - customized 
Integrated circuits and a 1.3 Gbit Integrated circuit memory 

1 Vertically polarized antenna for RFI detection 

1 Log Spiral polarized antenna for RFI detection 

1 Conmerclal Spectrum Analyzer for RFI detection 

1 Standard Large Disk 120 Mbyte, 7.5 Mbyte/sec R/W rate 

1 Standard Tape Drive 10 Kbyte/sec R/W rate 

1 Raster Scan Video Monitor with 3-D Analog Control 

1 Microprocessor 

1 Gbit total of Dynamic RAM 

System *s Total Size 

The OSD will fit in one medium-sized room. Existing prototypes of the 
four large capacity discs are surprisingly small, measuring slightly less than 
a standard size disc. It Is estimated that the CWO w1 i i occupy an additional 
;wo racks, typically with one processor to a board, excluding the “Generalized 
"oherence by Rows," which will occupy several boards. Twc additional racks 
will hold the Central Computer CPU and the M1n1compute’"/D1splav processor. This 
'.s shown In Figure 4.31. 
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Main Dish 





Systw Cost 

tte list b€lw tlw cost of the Mjor conponents of the Oesis Signal 
Detector. 


2 Tape Drives at 250K 

$500 

4 Discs at 75K 

300 

2 Instant Replay Disa 

150 

1 Terminal 

200 

All other RAM 

120 

Central Computer 

10 


$1280 K 


Since a gemral rule of thumb for digital design Is that construction costs 
equal parts costs* the Oasis system would nominally be costed at tiiree million 
f^979) dollars. 
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Chapter 5 
Human Interface 


*Y«s» I hive a pair of ^es»" ropllad Sm» "and 
that's just It. If they mos a pair o' patent 
double minim magnifying gas microscopes of 
hextra pca^r» p'raps I might be able to see 
throu^ a fll^t o' stairs and a deal doon but 
bein' only ey^» ym see, my vision's limited." 

—Charles Didcens, Pldcwlck Papers 


5.1 “ Problem Statement 

The human observer performs a member of key functions In the detection 
system both In tiie capacity of "pattern (tetector/recognizer" and as a decision 
maker who can redirect, verl^, and si^lement the automated phases of the 
system. The human Interface system Is an Interactive link be^^een the human 
observer (operator) and the rest of the system. It Incorporates the observer 
In a variety of ways: as an alternative pattern (tetectlon algorithm, as 
me component of an algorithm which Is mainly automated, as a dec1s1on*maker 
reqmsting Information from the automated algorithms, and finally, as a 
trailed scientist familiar with the astronomical context of the SETI program. 

The in which Information Is displayed often determines Its actual 
usefulness. Effective decision-making about Interesting ETI signals cannot 
occur unless that Information Is presented In a manner commensurate with an 
observer's perceptual and Intellectual capabilities. The system uses an 
advanced Image processor and multiple visual display screens as the primary 
mcKles of data presentation. An auditory display provides supplementary 
Information. 


5 .2 - Using Human Information Proces^^ing Capabilities In SETI 

One all-too-common approach Is to automate every function that Is 
technically possible and economically feasible. Whatever Is left to do Is 
relegated to the system operators by default, kn alternative procedure. 



and the one being advocated here. Is to consider tf« hwman elemeni as an 
alternative design component* complete with all the tradeoff characteristics 

one would identify in selecting one piece of hardware over another. The ; i 

J j 

first question asked when considering the implementation of a search 

f] 

strategy or algorithm should not be how do we automate it, but, rather, 

should we automate it; if so, is total or semi -automation most appropriate? ^ 

I 

i I 

The tradeoff for accepting some degree of inconsistency and low 

precision in performance specifications is that the human being as infer- fl 

ii 

mation processor is far and away the best model on the market, if it can 
be determined how to effectively use him. In developing the observer's || 

role in the human- con^ju ter decision-making process, the following general 

I i 

attributes of human information processing are important. 

1. Human performance often exhibits what has been termed "graceful H 

. 1 

degradation" under less than optimal conditions. That is, performance 
falls off gradually as conditions worsen, and significant (although not ! 

optimal) discrimination performance can be obtained even under very poor 
conditions. This can be contrasted with the discrete "go-no" paradigms 
characterizing most automated algorithms. 

_ i 

2. Human beings are conditional samplers of information. That is, the 

rate and type of information sampling can be continuously modified based on i 

past experience and recent events. A human observer can frequently pick up 

: I 

clues to pattern phenomena by synthesizing events which, at the time they :! 

were perceived, were seemingly unrelated. Further, if he thinks he's "on -i 

to something", a human observer can then selectively attend to a narrowed 
range of signal stimuli, thus markedly enhancing sensitivity to those j 

signals on a temporary basis. 

3. People can process information on several levels. The •highest" 

level is conscious awareness. However, a tremendous amount of information i 


Is coded and stored that Is not necessarily accessible to conscious SMarentss; 
long-term storage Is operationally Infinite. 

Even though an Information display Is "too large", presented "too 
rapidly" or lacks meaningful ness at the time It Is viewed. It mey be 
processed on a subconscious level to the extent necessary for ttie observer 
to refer to It In subsequent decision-making. 

4. Certain attributes of a stlmlus can be discerned even In the 
absence of sufficient Inforoatlon to permit Identification. For exan4>1e, 
color can frequently be perceived before form; motion before luminance. 

Thus, a human can scan large quantities of Information very rapidly using 
certain stimulus dimensions (e.g., density) as a gross sort. 

5. The trained human observer can develop and operate under very 
cfxt|}lex (by computer standards) decision criteria. For exmnole. In a SETI 
(Aservation, amplitude of extreme levels Is obviously of Initial Interest 
regardless of other values (e.g., polarization). Below this extreme value, 
we might require additional parameters to attain given levels before the 
combination of information looks favorable enough to warrant detailed 
examination (e.g., type of polarization, consistency over time). Thus, an 
intricate system of multiple criteria which can be modified by experience 
can be developed by a skilled user. Although automated algorithms can 

be differentially weighted for various contingencies or combinations of 
variables, these weights must be derived prior to the search process and 
fixed arbitrarily at set values during the data analysis procedure. Humans 
can modify these weights "on line" under real time constraints. 

6. Human algorithms can be altered by selection and training 
without the extensive hardware modification associated with son« automated 
changes. It should also be noted that a hunan observer may be able to give 
Information about why a currently used algorithm Is not producing the desired 
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results; such qualitative Judgements can be useful In Interpreting more 
objective Indices of sjfstem performance. 

Straight lines and edges are very easily detected visually by humans 
On the other hand, curved lines and dashed lines are less easily detected. 

In the present SETI context, It would therefore seem that a human observer 
would be adept at detecting a linearly drifting signal as shown as a 
display of the time- frequency output of the multi -chanr^l spectrum analyzer 
(ICSA). On the other hand, a person may have more difficulty with pulses, 
and possibly greater problems detecting a wavy line embedded In a noise 
background. These predictions are Indeed verified In our preliminary 
experiments (Appendix A5.1). 

A more global characteristic of importance in human pattern detection 
is perceptual "symmetry." People note symnetry in visual patterns, prefer 
symmetry among components, and recall symmetrical patterns over time 
intervals more effectively than they do nonsymmetri cal patterns (Garner. 

1974; Chipman and Mendelson, 1979; Attneave, 1954; Szilagyi and Baird, 1977). 
Patterns symmetrical in time and/or space may be noted by the human observer 
over regions of the spectrum that are not compared by the automated 
algorithms described in Chaoter 3. For instance, the human observer may 
note a similarity among clusters based or the N-bit representation of data 
blocks (where one pattern block • 200 x 50 data blocks). The simplest form 
of symmetry, of course, is recognition of the same pattern occurring at 
different time intervals but within the same region of the MCSA output 
space (say for different stars). 

A further advantage of using the human observer as a (tetectlon device 
is his/her ability to deal with stimulus context. The letter A can be 
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written In a variety of type fonts, and yet the htman can usually perceive 
(rec(^n1ze) the characteristics on ^e pa|^ as m A. Machines have 
terrific difficulties with slmllwr contextual probl^, as attested to hy 
the care with which nunt>ers ar« stamped on bardc cl^clrs to permit automatic 
pattern recognition! In the SETI program, tee himan observer will be 
harder to fool than the automated algorithms udiose operations are more 
general than those of the "check protessor". Additionally, tee observer 
will be able to note the same pattern presented in a wide range of 
surrounding contexts. 

In summary, the hwnan as a detection algorithm will be effective 
at pulling out certain features of the fCSA cxitput and results of tee 
clustering scheme that will be missed by the automated algorithms. In 
this way, tee observer both extends and supp1en»nts the automated aspects 
of the system. Tte role of the observer as a detection device Is also 
advocated by Fender and Evans (1971) who review many of the advantages 
mentioned above, but In situations where the signal Is known beforehand. 

Perceptual Sensitivity 

It Is anticipated that signals from extra-terrestrial civilizations, 
having traveled great distances through an uncertain electromagnetic 
environment, are of weak Intensity as compared with background noise. The 
psychqshysical models that deal With this type of situation as It applies 
to the htanan observer are part of the "theory of signal detecttelllty" (TSD) 
(Baird and Noma, 1978; Green and Swetz, 1966; Swets, 1979). 

According to statistical decision theory one can choose an optimum 
threshold If the values of all ou'^comes and the probabilities of signal and 
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noise present are known. *n\1s threshold Is defined as 



where « probability of signal plus noise being present 

Pj^ ■ probability of noise alone being present 

N • no response, “noise'' 

Y ■ yes response, "signal" 

» value of a correct rejection 

= cost of a false alarm 
n 

Y “ value of a correct detection 
N ■ cost of a miss 

In the SET! context, P^ Is much larger than P^, thus suggesting that an 
observer would set a high B, and hence, would be disinclined to state a 
signal was ever present. But the second term of Equation (5-1) will tend 
to balance this bias. Looking first at the mmerator, the positive 
value of a correct rejection, is small relative to the negative 
consequences of a false alarm Y^. In the latter case the data must be 
archived, sorted and so forth— assuming Y^ Is negative the numerator will 
be a positive number depending on the cost of archiving false alarms. 

This drives B up and reduces false alarms. However, the denominator more 
than offsets this tendency. The negative value of a miss N^, though 
potentially large, merely represents our current level of Ignorance, and 
Is hopefully small relative to the value occurring from a hit Y^. The 
latter term Is obviously a subject for debate, but It Is probably at least 
as high In respect to the negative outcome of a false alarm as the 
prcbablllty of noise Is In respect to the negative outcome of a false alarm 
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as tha probability of noise Is In respect to the probability of a signal- 
otherwise » we would not be Initiating a seardt. Therefore we conclude 
that the far right hand term of Equatitm (4-1) Is approximately the Inverse 
of the middle tern; thus leading to a moderate 8 of artxind 1.0. In other 
worxte an (^server will not be fearful of suggesting tire presence of poten- 
tial signals, but will certainly be prevented from going overboard (claiming 
signals are everywhere) by the large value of built up through past 
experience. Although the foregoing analysis is not Intended to be rigorous, 

It does serve to highlight some of the very real factors an observer must 
cmsider In deciding whether or not a signal Is present on the display at any 
Instant In time. 

Threshold Setting and False Alarm Rate 

The preceding formula shows that the annoyance of false alarms Is but 
(me of four parameters that imist be considered when establishing the thresholds 
of the algorithm. If we take into account all four trade-offs, we can deter- 
mine the optimal threshold of the entire system and use this to set the thresh- 
olds of the Individual processor alarms. For example, if the formula Is used 
to determine that It Is optimal to have at least one alarm sound from the system 
with SOX probability every 80(K) seconds, on the average. To uniformly weight 
the Individual thresholds T^j, 1 > (1, N), J > (1, M), where N Is the nunber 
of tlnms an alarm may possibly sound In this period and M Is the Index of the 
particular alarm, we simply require that the probability that one will sound 
in 8000 seconds be P. P Is then given by (1-P) • .5, where we have assumed 

as an approximation that the alarms are Independent. 
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This Is the simplest arrangement. To weight different alarms, so 
that, for example, on the average the CMO alarm sounds more often than the 
PD alarm, we then require HN 

i," 

where P^j Is the weighted probability for the particular alarm. 

The operator then, with the aid of a software routine, will select the 
mean time between alarms he desires, the relative probability for each alarm, 
and whether he would like the alarm to cause the central computer to 
automatically archive the entire observation or whether he will be contacted 
by i.he computer to pass the final decision (more usually the case). 

Human Perception of Relative Magnitudes 

Evaluation by humans of the processed receiver output data relies on 
the observer's ability to perceive the relative magnitudes of visual 
attributes such as brightness, length, position, and color, and auditory 
attributes such as loudness and pitch. When humans are given a reference 
level of some perceived quantity and told to assign that level a numerical 
value (say 10), they can give numbers to represent the relative perceived 
magnitude of other levels of that comparison quantity In respect the 
the reference level. Assuming these numbers are values along a ratio scale 
of perceived magnitude, the relationship (mapping) between physical and 
perceived magnitude can be described by a power function: 

^ ( 5 - 2 ) 
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where t|>, the perceived magriltude (numericel rating) Is a power function 
of the physical Intensity (S). The exponent, y, depen<te critically on 
the type of quantity being perceived. The multiplicative constant, X, 
merely Is a scale factor relating units of measure. The Interesting para- 
meter Is the exponent. It Is approximately .3 fcxr both brightness and 
loudness, and closer to 1.0 for spatial position and length (Baird, 19^; 
Baird and Noma, 1978; Stev«is, 197S). In other words, people are relatively 
accurate In their estlmatlcm of spatial position and length, whereas they 
grossly underestimate the relative intensity of lights and sounds. The 
practical Importance of this result Is that one gains little by having a 
CRT with 2* levels of brightness In a cell (pixel) since tN human observer 
cannot resolve this many brightness levels. Similarly, we cannot anticipate 
high resolution of displays coded by auditory Intensity. 

IMIlke Quantitative attributes such as brightness, loudness and 
perceived length, sound frequency ar^ hue are translated Into Qualitative 
perception dimensions. "Green** Is not perceived to be twice "blue**. 
According to Stevens (1975), these attributes do not follow the power law, 
arx) hence, we expect to use them In current systan In cases were qualitative 
differences of low resolution are displayed. 

Another psychc^yslcal technique of relevance to our task is the 
method of absolute Identification (Baird and Noma, 1978). Here a series 
of stimuli are presented sequentially and $(d)jects must assign a imlque 
label to each. By Information theoretic measures, the channel capacity of 
the subject Is determined for attributes such as light, S(Hjnd and frequency. 
Surprisingly, stojects are only tole to absolutely Identify approximately 
6 or 7 different magnitudes (without error) for unidimmsional stimuli 
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(Hiller, 1956). As the dimensionality of the stimulus pattern Increases 
(say It varies In brightness, hue and site) the error-free level of 
Identification goes up. In the present context, observers In the SETI 
program will be required to recognize radio frequency Interference (RFI) 
at different points In time. We can expect such recognition capability 
to depend on the complexity of the Rri. The greater the number of dimen- 
sions one needs to define RFI, the greater the recall. For detection of 
signals themselves at low signal to noise ratios, we cannot expect the 
observer to be much help In recalling s1n^)le patterns over long time 
periods, or being able to discriminate a large number (certainly no greater 
than 10) noise displays. In brief, the human observer Is limited In 
ability to recall simple targets over exten<ted time periods, especially 
if Intervening noise Is present. 

5.3 - Display Parameters Affecting Observer Performance 

This section will discuss the major display variables associated with 
CRT's and their relaticmship to the human operator's search procedures. 

Hany real-world tasks have been Intensively studied (e.g., radar monitoring, 
piloting), and thus a considerable amount of Information Is known about the 
task dynamics. 

Lumi nance 

The luminance required for a CRT display is heavily dependent on 
contrast, airt)1ent environment and the nature of the visual task. Gould 
(1968) cites 24 ml as a minimum figure, and recommends 50 ml for most 
applications. 
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Brightness (B) Is the subjective IntHresslOn of limlnance (L). Uhder 
laboratory conditions the tMO are related by a power function (Equation (5-2)). 
where the exponent Is typically between .25 and .50 for snail* honogeneous 
fields (Narks and Stevens, 1966). This relationship does not hold for 
complex forms, however, particularly at higher luminance levels. The 
relationship appropriate for the SETI display will depend In large part on 
the scale of representation used (i.e. , what each dot corresponds to), and 
on the mental set adcx>ted by (or "trained Into") the observer (I.e., how 
large, relative to the entire display, must a signal be?). 

Another ccmsideratlon for the SETI operator is the duration of the 
observing periods. Relatively high luminance levels auy be optimized for 
short-term signal detections, but may also result In excessive glare and 
eye Irritation over time. 

In simmary, the CRT display will be made highly adjust^le In luminance 
and win be optimally reset for ead» graphical representation. 

Resolution 

Assimlng adequate luminance and contrast, a complex target Image should 
subtend no less than 12 min. of arc (Grether, 1972). For "detail" resolution 
(e.g., one picture element) no less than 1.5 to 2.0 min. of arc Is required 
(Bennett et al., 1967). 

Regeneration Rate and Viewing Distance 

When the regeneration rate Is too slow, the Image appears to flicker. 
Angular size, luminance and phosphor persistence can all affect the 
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regtMratlon rate *fhidi is necessary to obtain a "critical flicker 
frequency" or CPF (the point at which no flldcer is perceptible) for an 
observer (Kelly, 1965). Luninance and an^lar size alone can cause as 
nuch as 15 cps variation in the needed regeneration rate. When the 
individual d1fferen(»s in an observer's |)erceptual characteristics are 
ad(ted to the equatitw, it becomes obvicNis that there is a wide var1d)il1ty 
in the required regeneraticm rates for various applications. 

Current image processors may use marginal regeneration rates because 
of the reciprocal relationship between the number of binary picture 
elements, N, arel the regeneration rate, F, which is (in bits per secomfe, 

I (max) ) 

I(max) * F X N (5-3) 

Flicker can intact the observer's task in several ways. It can 
produce irritation and eye fatigue, particularly with the long viewing 
periods anticipated for SETI. It can also affect pattern perception 
indirectly by its influence over the viewer's distance frwa the screen. 
Thompson (1957) f(wnd that observers prefer viewing distances which are 
far enough from the screen such that small area flicker (e.g., that caused 
by the interlace between scan lines) is not resolvable. Thus, in 
addition to degrading pattern detection at relatively close distances to 
the screen, the presence of flicker may also determine the range of viewing 
distance adopted by the observer. 

For the SETI application, an image processor with significantly less 
flicker than is typically found on CRT displays is indicated. A great 
deal of research is currently In progress to develop flicker-free image 
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processors (with ade<^ate Informaticm display voliMes) for tt^ iHirpose of 
Medical dlagiKKls. Special versions of liiage pro^sors ttesIgMd for 
Medical applications are also Marketed currently, and shmild be seriously 
considered In SETI. Some Implications for human observers using the 
systems are gl^n In Picket and Triggs (1975). 

Operator Training 

The unique characteristics of the SETI app11cat1(Ki places cMisIder^le 
significance on the role of the system operator. Of particular Interest for 
system designers Is the potential long-term training effect. This could 
result In a specific observer becoming a much more effective signal 
detector over time. The SETI observer Is not a typical "monitor". In a 
typical monitoring task where an operator monitors a CRT screen, and may 
conduct simple dialogues with the computer, a typical training curve would 
tend to asymptote after a few weeks. Any residual 1mpn)veroent as a function 
of practice for such tasks is Insignificant for all practical purposes. 
However, in most monitoring tasks, "sufficient" or even c^timal performance 
can be clearly defined when training begins; that is, if 100 signals are 
presented, 100 detections would equal perfect performance vdiile perhaps 
98-99 would equal "sufficient" performance. With the aid of guidelines and 
Instructions, the learner goes through a series of successive approximations 
until criterion performance Is reached. 

For the SETI observer, no such feedback Is possible. By the basic 
assunqstlons underlying the program, an ETI has never been detected. This 
in^lies that no direct imasure of system performance (I.e., successful 
(tetectlon) will be possible for most cases. Therefore, the observers' 
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perceptions of their oum performance may vary widely, and perhaps In an 
Idiosyncratic manner. For example, oi« operator may decide to alter his 
search strategy because It has brought no results after 2 hours; another 
operator may follow a procedure for a significantly loiter period before 
modifying or discarding It. Oi^ operator may feel that one strategy Is 
more effective than another because It results In the selection of fewer 
total data regions for review, and thus is more sensitive; another operator 
may feel that this same approach results In too high a prob^illty of 
missing a weak signal. 

It should now be fairly clear that the training of SETI obserwrs will 
be far fron straightforward for the free mode of operatlrm. 

The training program Is directed toward the following goals: 

1. Observers are able to efficiently access and effectively use the 
display enhancement of the data. Data in various states of compression and 
analysis may be called up for viewing. Information from the different pro- 
cessors in the system will be assessed separately as well as interactively, such 
as might be required for a signal detection. Because each operator may put in- 
formation from various sources together differently in a complex signal detection 
process, it will not always be clear what aspects of the information display are 
being utilizedi and with what degree of effectiveness. By assessing the ability 
to use all information sources available, it can be determined if an observer 
can theoretically obtain the maximum information available for signal detection. 
Of course, this does not guarantee that all dimensions will be (or even should 
be) used in an observer's signal detection strategy. 

2. Operators are encouraged to try alternate strategies. An analysis 
of search strategy archival records may suggest desirable patterns of search, 
which could be denronstrated to other observers. 
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3. An ongoing evaluation Is conducted on veteran observers to deter> 
m1r« If continuing experience enhances the ability to discriminate unusual 
(rare) patterns from more typ1c*l noise fields. 

Summary and Corw:1us1ws 

The nature of human-computer Interaction In complex signal detection 
and recogniticm systans Is perhaps the least developed aspect of such systems. 
The unique and virtually unexplored context of the SET! progrmn makes 
research In this area a vital and potentially rewarding endeavor; not (wily 
for SETI, but for the areas of hunan-coBHauter Interaction, cognitive 
psychology and hwnan factors as well. 

The state-of-the-art In automatic pattern detection Is far less 
advanced than scientists predicted five or ten years ago. The human 
{^Jerator will eaitlnue to represent the celling for system performance 
for the foreseeable future. As such. It is critically Important that 
research be conducted which Is aimed at deriving guidelines for observer 
task design, and for display-environment specification. The NATO Research 
Study Group on Pattern Recognition (Hodge, et al., 1979) cited the failure 
to Integrate human factors consideraticms Into automatic pattern detection 
systems as the major failure of systems design In this area. 

5.4 - Human Factors Considerations 
Design Philosophy 

The hunan factors system analysis Is required to ensure that the task 
procedures, environnental condltlt^s and equipment designs are compatible 
with the psychological, f^yslologlcal and performance characteristics of 
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the Intended users. The tteslgn philosophy un^rlylng this analysis can be 
sunmarlzed as follows: 

1. Human performance for critical tasks RHiSt be, to ^ extent 
possible, designed Into the system at the Initial phases, not retrofitted 
after the fact. 

2. Since humans are highly variable, there are virtually no config- 
urations which are optimal for everyone. Systems can be d^igi^d to narrow 
the range of himan variation through selection and training; the residual 
variation must be considered appropriately In the system ctesign for human 
performance (the lower levels are subsmned under the advanced levels): 

A. Human Engineering: assuming adequate motivation, knowledge and 
skill, design the system to be operated in an optimal manner 
(easy, safe, reliable). 

B. Human factors: assianing adequate motivation, but not necessarily 
appropriate knowledge or skill, design tl% system to provide job 
aids and other support mechanisms to design compensate for human 
errors due to Ignorance, task overload, envl raimental stress or 
Improper decision-making. 

C. B^avloral Engineering; assuming notr.ing, design the system to 
predict the human failure modes Including all errors (deliberate 
or Intentional) due to neglect, sabotage or misuse, through one 
or more of the following three methods: 

I. Prevention: make the undesired behavior less probable 

II. Exclusion: make the undesired behavior Impossible 

III. Fall safe: assume that the undesired behavior will 
occur, and minimize the consequences to the system. 
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The level of design selected* and the method of ImplementatlcHi 1$ a 
function of criticality and system resources. Table 5.2 outlines the 
major areas of analysis. 

Again* the human factors function should be concerned with matching 
the tasks* procedures* equipment and envirorwiental conditions comprising 
the Display Subsysten to the expected characteristics and capabilities of 
the system users In an "q|)t1ma1'' manner. (For further discussion of 
methodology* see Van Cott arwl Kinkude* 1972; or Chapanis, 1970). In ^1s 
case, we assimie that the range of skill* training, aptitude* motivation and 
performance manifested by system operators will be significant. Because 
operators may be drawn from a limited source (e.g., radio astronomers)* ^e 
ability to modify these attributes through selection arwi training will be 
restricted. 

As such* the limitations and predicted behaviors of the human operator 
become the "givens" ar(Hind which the Operator Interface must be designed. 

The theoretical and expected capabilities of the operator serve as a 
celling on siA>system performance. From this ultimate value, the degree of 
degradation In operator performance caused by the operational conditions 
of the system (e.g.. Improper Illumination, noise, task overload) must be 
predicted. 

Stmwary of Some Key Human Factors Issues 

1. The operator's task must be designed In a manner which will maintain 
active participation In the search process but which will not produce 
excessive false alarms* and will not result In overload- Induced 
errors. This will require a detailed untters tending of the Intended 


1. Physical Design and Work-space Layout 

Ex.: Can the operator readh all controls and see all visual 

displays necessarily while performing sequential operations? 
Is the display-control ratio appropriate for the level of 
adjustment required on all image processors? 

2. Workload and Task Requirements 

Ex.: Is the workload sufficient to prevent boredom, fatigue and 
perceptual distortion but not so high as to create stress- 
induced errors? 

3. Task and Procedure Design 

Ex.: Is the operator receiving sufficient feedback to permit him 

to modify his performance (e.g., correct errors)? Is the 
appropriate sensory modality being used to display task- 
related Information? Can one tell If the operator Is 
performing as expected? 

4. Emergency and Maintenance Procedures 

5. Training and Selection Criteria 

Ex.: What level of training Is required for system operation? 

What training methods are appropriate? What Individual 
characteristics should be considered desirable (or undesir- 
able) for operator selecticm (e.g., spatial visualization 

ability)? 


I 

Table 5.2 - Major Categories of the Human Factors Functional Analysis 
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user groups. A description of relevant user draracterl sties s' ould 
be completed prior to the finalization of systen procedures. 

2. The operator should be acquiring knowledge arwl experience by performing 
his role In the system, I.e. , In addition to aiding the cmgoing search 
process, tte or^rator should be getting better at what he (toes over 
time (see "Training", In this chapter). 

3. Tradeoffs between optimal display settings (e.g., limlnance contrast) 
for "short-?term" and "long-term" momltorlng must be Investigated 
since the operator's perceptual system will tend to adapt (selectively 
desensitize) over time. 

4. The operator should have his performance monitored periodically to 
provide fee<t>ack and maintain attention allocation. This may Involve 
the generation of test signals to see If the of^rator detects them. 

Test signals should be well above detection threshold to prevent a 
casual response set from developing. 

5. Comm(Xfi failure modes must be anticipated and cteslgn compensated. For 
example, an observer who Is absorbed In an active search process may 
forget to permanently record and save the data. System design must 
anticipate this type of event, and handle It by one of the three 
generic approaches (prevention, exculslon, or fall safe), 

6. The role of system users Is variable In the free mode so that skilled 
operators can fully exploit the system's capabilities. However, If 
naive or low-skilled operators are employed for fixed-mode tasks, these 
procedures should not require excessive Interpolation or skilled 
Judgn^nts. 
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Table 5.3 outlines the ke:' display parameters whldi are relevant to 
the himian factors analysis while Table 5.4 summarizes the principal human 
factors considerations for the Interactive search system. (The sequence In 
which Items are found In Tables 5.3 and 5.4 does not reflect order of 
consideration or criticality). 


1. Viewing distance 

2. Angle of view 

3. Image size and variability 

4. Brightness and luninance 

5. Contrast (range and proportion) 

6. Visual field surroundli^ the display 

7. Ambient environment 

8. Resolution and Image st^lllty 

9. Duration of a display pattern and Interval length 

10. Scale of dots (size of mlnlimim Informational unit) 

11. Color 

12. Single vs. multiple observers 

13. Degree and type of visual cues (e.g., cursors) 

14. Coding methods (e.g., shape, flicker, brightness) 

15. Visual acuity required: level and type 

16. Density of visual noise 


Table 5.3 - Selected Visual Display Parameters Relevant to Human 

Factors Analysis 


5.5 - General Structure of the Image Processor 

The automatic signal detection algorithms alert the operator of a 
possible signal caused by RFI, random noise correlations or astronomical 
signal sources. This presents a massive data analysis and archiving 
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1. Dtsplay-Control compatibility 

2. Population stereotypes; transfer of training 

3. Rate of Information Input and requirmnents 

4. Task loading 

•S, Reaction to predicted env1ronn»ntal stress levels 

6. Training (skills required) 

7. Operator background/experience 

8. Speed vs. accuracy required (human reliability) 

9. Information coding (e.g., shape, color, pattern) 

10. Task design (e.g., sequency, coherence, dependency) 

11. Communication requirements 

12. Feedback and knowledge of results 

13. Motivation 

14. Work-space layout 

15. Anthropometrics/blomechanics 

16. Display mode selection and specification (for each task) 

17. Maintenance/trouble 

18. Design compensation for the disabled where possible 

19. Work-rest cycles; circadian-biological rhythms 

20. Emergency procedure/safety 

21. Aesthetics (attitudes toward the work context) 

22. Flexibility for future use 

23 Public relations and system documentatim (ability to demonstrate to 
naive persons) 

24. Operator/system performance assessment 

25. Behavioral engineering; resistance to 1ntent1(xial and accidental 
abuse 

Table 5.4 - Representative Hunan Factors C(xis1 derations 

i 

1 
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prc^lem and the operator's effectiveness In dealing Mith It depends upon 
the ability to quickly and efficiently view and manipulate subsections of 
the signal detector output and make correct decisions about the data 
presented. To further complicate the pr(^1em» the most Interesting 
signals will probably appear at a very low S/N ratio, so sophisticated and 
versatile Image enhancement techniques will be necessary to separate such 
a low-level signal from noise. 

A high-speed digital Image processor allows the operator to perform the 
various tasks associated with signal evaluation and system operation with 
high efficiency. In what follows, we describe the general characteristics 
required of a SETI Image processor. 

In response to a signal alarm, the operator requests that the subsection 
of signal detection output containing the signal be sent to the Image pro- 
cessor's refresh memory. The refresh memory In a minimal system Is two 
frames of static or dynamic random access memory (RAH), each providing 
three channels of 512 x 512 spatial resolution In 8 bit pixels (picture 
elements), and two 512 x 512 x 1 bit graphics overlay planes. This system 
allows for future expansion In resolution, number of frames and number of 
channels. As the need arises and as memory becomes cheaper. It will be 
possible to add more displays and/or enable the operator to scroll through 
a larger data base with fewer transfers from the host computer. 

The graphics overlay planes store system status Information for display 
on a black/white (B/W) status monitor to provide overlays such as graphs of 
statistical Information or to designate subareas of the Image for furtiwr 
processing. 

A track ball or a digitizing tablet and stylus Is used to position a 
cursor on the display. A trackball Is a captive sphere that can be rolled 
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In any dirtctlon causing a corraspondlng iwvwient of tht cursor. When 
used In conjunction w1^ function switches « Individual pixels can be 
Identified or an area can be outlined. The digitizing tablet and stylus 
can be used In a similar manner while additionally providing iMMdIate 
hard copy on paper with an Inking stylus. Scrolling through a zoomed Image 
or an Image larger than 512 x 512 Is done with a trackball or joystick. The 
operator Is able to scroll quickly by rows or colunms through the entire 
refresh memory. Zoom Is accomplished by posltlonlf^ the cursor and 
specifying the zoom factor. Eng1arg«nent of at least 4:1 Is done by pixel 
replication without altering refresh nwnxrry concents. 

When data from both polarizations or two algorltNns are to be viewed 
slimiltaneously, a split-screen view Is used. This Is accomplished by 
selective lolling of refresh memory by the host computer or by reading 
subsets of two d1ff«^ent refresh monorles. A feedback loop allows the 
processed Image to be written back Into refresh memory fcr Iterative 
c^eratlons and spatial convolutions. The operator can perforin sophisticated 
Image enhancement functions In evaluation potential signals without Involving 
the host computer. 

A built-in microprocessor handles all Internal control functions as well 
as some processing operations, thus off-loading the host con^ter of most 
Image-oriented tasks. Microprocessor programs will be down-loaded from a 
library on the host computer. The microprocessor performs diagnostics and 
aids In locating problems within the Image processing system. 
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Figure 5.1 
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Image Processor Conceptual Diagram 
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Image Processor 






















S.6 - S^stwi Overvtfw 

The displeys $id>system must perform four types of functions: system 
stati;m, (grating Information, alarms, and detection. 

System status Is the operator's bookkeeper. It keeps track of all 
Information concerning system faults and operating parameters (target star, 
gain, bandeldth, etc.). Further. It provides for Input regarding these 
parameters by the operator. Operating Information Includes a library of 
procedures and catalogues readily accessible from airplane schedules to 
How to Replace a 16K RAN Board (In three easy steps)". Alarms signal the 
human that the machine requires some attention, vdtether It be to check 
a possible ETI detection, to see why the NCSA malfunctioned, or to perform 
maintenance operations (e.g.. put more paper In the line printer). 

Detection Is really what the entire system, and Indeed SETI Itself. 

Is all about. This Includes everything the operator uses to either search 
fair ETI signals or to cull out RFI from automatically "detected" signals. 
The display parameters are readily changeable by the operator. Complex 
analyses, such as 2>D Fourier or Walsh transforms, that are too expenslw, 
time consimilng. or experimental to be done on the entire output can be 
done quite easily on a 512^ display matrix. Such support systems could aid 
the operator significantly as the SETI program advances Into more sophisti- 
cated analyses of the radio spectrum. 

There are two major modes of operator observation, "fixed" and "free". 
By fixed mode we mean the operator responds to the results of the automated 
algorithms and system stctus test. In this nwde he/she monitors equipment 
for proper functioning, and when a detection occurs (alarm from automated 
algorltoms) It Is analyzed, and appropriate action Is Initiated. In the 
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"free mode" the operator acts with greater freedom concerning search 
strategies. For example* one mey monitor a selected region of the NCSA 
output displayed on the color CRT, listen to a selected region of the 
spectrum, or call up statistical analyses associated with specific tests 
in the N-bIt number. 

Fixed Node 

Although the flexibility of the human observer in formulating, modifying, 
and executing search strategies is a major resource for SETI, less flexibility 
is required for the fixed mode. Here the operator performs a relatively 
fixed set of procedures, especially In response to an alarm. Failure to 
attain high levels of standardized performance would result in a non-systematl c 
data base which would hinder the cumulative and longitudinal nature of the 
SETI program. There are two phases of the fixed mode: ntxrmal and 
examination. 

Normal State - Between alarms the operator is provided with both visual 
and auditory displays that give a selected sample of all incoming signals, 
as well as status information on the observations and equifxnent operation. 

The color CRT displays a subsection of the NCSA output spectnmi. Each 
spectrum is displayed as a horizontal line, with the intensity of a pixel 
proportional to the average signal power at that frequency. Each second, a 
new spectrum is added beneath the existing ones. In this way up to 512 
seconds of output mqy be observed simultaneously. Since there will be two 
NCSA spectra (one for each polarization), the display will alternate between 
them at the discretion of the operator. 

The auditory portion of the system serves as a final step In checking 
a possible signal of Interest which has been flagged by an algorithm or by 
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the operator. A search of the entire 8 NHz bami each second Is not under- 
taken due to the complete Ineblllty of ^e operator to distinguish differences 
at such a rapid rate. Since an alarm will not yield Infontmitlon as to which 
NCSA data caused the flag, the computer-controlled tundble system checks the 
real part of the Input for each NCSA. 

The auditory system taps Into the data flow at an Intermediate frequoficy 
(IF) before the fCSA. Using an appropriate local oscillator, mixer, and 
26-34 MHz bandpass filter as shown In Figure S.4 , the IF signal Is hetero- 
dyned down before being received and converted to the auditory ran^ of 0-8 KHz. 
As alarms are Sounded for possible ETI's, the computer controlled system 
places the Indicated algorithm frequency In the middle of the audio band. 

For algorithms using 40 Hz by 20 sec. blocks, the center block frequency Is 
the one used. 

Based on cost, the pre*MCSA data are not stored. Since the auditory 
system Is In real time, the operator hears regitms which are down stream 
"n" seconds from when the alarm was triggered. Therefwe, the signal may 
have drifted off the alarm frequent^ by the time the operator liears" it. 

The number of seconds "n" may vary from a few seconds to 1000 seconds. For 
the worst case of n ■ 1000 seconds, and a drift rate of 1 Hz/sec., the 
signal would drift only 1 KHz and would still lie within the 8 KHz band. 

To check for irore rapidly drifting signals, and to allow for error, the 
receiver tunes to the indicated frequency and to adjacent 8 KHz bands. The 
auditory display correlates real time with the past, and yields a check for 
the continuing presence of the detected signal. In alarms which originate 
from the cluster analysis (n ■ 1000), the operator may elect to hit the 
ov^erride button and hold the antenna In tte location of Interest. 
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Auditory System 
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A second black and Mhite CAT continuously displays a varied of 
system*status data. Some of this Information Is shown only upon reqiMSt. 
This Information Is also recorded In permanent archives whenever an alarm 
Is signaled by either the operator or the automated algorlthna. Typical 
status parameters are: 

1. Observing Parameters 

a. Telescope pointing direction 

b. Target star name* distance* type* etc. 

c. Sky map of the area around the antenna beam (about 10 beam- 
widths) Including all known astronomical obj^ts* at all 
wavelengths 

d. Date* time* polarization 

e. Operator's name 

2. Receiver Parameters 

a. System gain and bandpass shape 

b. telf-dlagnosis Information. Tests Rmst be end-to-end. 

c. MCS' Input and output bandwidths. 

d. Status of any selectable option 

3. RFI Parameters 

a. Radio and TV catalog and schedules 

b. Earth satellite catalog and schedules 

c. Airline schedules and flight paths 

4. Environmental Parameters 

a. Mind* precipitation and lightning Indicators 

b. Outdoor TV camera, steerable and zoomable* to allow watching 
the antenna and Its surroundings for unusual conditions that 
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could cause false alarms or malfunctions (snow drifts* 
flocks of birds or Insects* trespassers* etc.) 

Examination State - When an alarm occurs from the automated algorithms 
the operator carries out a predetermined set of procedures* somewhat different 
for different types of alarm. For exany)1e* the drifting carrier detector 
(CWD) triggers an alarm to the operator whenever the Integrated coherent 
power along a drift line exceeds the present threshold. The operator first 
requests a graphical output showing the location of all data In the vicinity 
of that which caused the alarm. At this point, and at each succeeding one 
in the decision process, the operator may enter a coded value at the CRT 
controls regarding the nature of the alarm. There are four possibilities: 
noise (N), radio frequency Interference (RFI), useful astrof^yslcal Infor- 
mation (AI), potential signal from extraterrestrial Intelligence (ETI). 
Selection of option 3 or 4 saves the region of Interest In a permanent 
record and automatically records all current status parameters. (H>t1on 1 or 
2 terminates consideration of the alarm. If the ETI option Is selected* 
the operator must decide whether or not the telescope should be kept cm 
the star beyond the 1000 seconds normally allotted. 

Before an option Is chosen the operator may request further data 
concerning the alarm. Specifically, through Instant replay the original 
NCSA output can be brought onto the color CRT for imxre detailed examination. 

In this case, the output Is coded In a frequency x time matrix* with 
amplitude represented by pixel brightness. By the use of split screen 
techniques* both NCSA outputs of the same region are shown. Through the use 
of the CRT, the operator can zoom Into a subregion* produce a histogram of 
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the Intensities, and have the appropriate vectors color-coded and displayed. 
TIm purpose of all such tests Is sl^ly to speed and aid the final (teclslon- 
maklr^ process with respect tc the four options (N, RFI, AI, ETI). This 
selection can be made at any time durlr^ a search with the consequent 
procedures outlined abpve. 

The operator also receives auditory 1nf(»*mat1cm about the current 
state of affairs In the region of Interest. As so(m as an alarm Is signalled 
to the (H>erator, the communications receiver driving the auditory display 
Is automatically tuned to the signal frequency (origin of the vector) and 
Its bancMIdth adjusted to encompass a 8 KHz region centered on the origin. 
Since the auditory pickup Is In real time, the operator is able to ascertain 
rdrether the signal Is still present In the region of the spectrum delimited 
by the instant visual replay. This sort of data will be helpful In 
deciding whether further observing t1n» should be devoted to that star region. 
In addition, the <H>erator enters any special comments about tte circumstances 
of the alarm via the hard-copy terminal. 

A second type of alarm Is one triggered by the clustering algorithm 
based grouping similar N-bIt patterns. Uhm the operator receives such 
an alarm, the Instant replay tape Is searched for the pattern block of 
Interest, as well as the two adjacent pattern blocks (adjacent In the 
frequency spectrum). A pattern block Is a 200 x 50 matrix of data boxes. 

Each data box Is a 40 Hz x 20 second region of the fCSA output. The pattern 
block gr(Mips 200 (overlapping by 8 Hz) frequency blocks of 40 Hz each by 
50 tliM blocks of 20 seconds each. 

The Inital display of these data Is of the critical pattern block In 
the center of the color CRT flanked and surrounded by the eight adjacent 
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pattern blocks (appropriately translated to prevent display of redundant 
data, and clipped on the left and right sides to fit on 512 x 512 screen). 

Since a pattern block Is based on data from both MCSA's, only a single 
(Hitput Is shown on the screen. The location of the N*b 1 t numbers of 
Interest are shown as Individual pixels of a constant brightness against a 
dark background. Details of the statistical tests comprising the N>b 1 t 
number are printed onto the hard-copy terminal. An example of this display 
in tabular form is given in Table 5 . 1 . At this Junction, the operator proceeds 
as previously described: decides which category Is appropriate to define the 
alarm, or request further Information. 



Tests Status 

Significance 

1. 

complex coherence 

1 

>ai 

2. 

broadband pulse 

1 

>02 

3 . 

degree of polarization 

1 

>03 

4 . 

goodness-of-fit 

1. 

.01 

5 . 

ANOVA rows 

0 

.20 

6. 

ANOVA columns 

1 

.001 

7 . 

ANOVA Interaction 

0 

.20 

8. 

total power 

1. 

>«4 

9 . 

8-Hz pulse 

0 



• threshold parameters to be given numerical values (for different levels) 
Table 5 . i . Sample Output to Represent Results of N-b 1 t Pattern 

If * decision cannot be reached Imnedlately, the operator requests that 
closely related bit patterns be displayed simultaneously within the pattern 
blocks. In this case, the alternative bit patterns are coded with different 
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colors. Finally, whilt still at the lewl of tha pattarn block, tha crltaHon 
or statistical laval at which an Individual tast of tha N-bIt pattarn was 
passad Is displayad as a brightnass valua (highar brightnass raprasants 
highar significance). 

If more detailed results are still necessary before a decision can be 
reached with any confidence, the operator specifies the relevant region of 
space within the pattern block by cursor control, and the original data 
boxes (40 X 20) In that region are displayed. By split screen techniques, 
both NCSA outputs are shown side by side. Simultaneously, the auditory 
display 1$ automatically turned on and moved to the same region of the 
current frequency spectrum. (This may be turned off, lowered In volume, 
and so forth, at the discretion of the operator). From this vantage point, 
the zoom, histogrmn and col<xr coding options of the CRT are used to highlight 
potential signals and eventually aid In the operator's decision concerning 
the underlying cause of the alarm. 

Free Mode 

In addition to responsibilities associated with the fixed-mode displays, 
the operator carries out analyses of possible signals In a free mode. Indeed, 
most of the time an observer will be viewing or listening to a noise field 
that does not contain a signal. In the free mode, the observer may think a 
signal Is present, based on the existence of an unusual pattern perceived 
on occasion within the display. In the SETI situation, such "thoughts" may 
In fact reflect bona fide signals. 

The real advantage of the free mode comes through the flexibility of the 
human being as a pattern detector. The Inherent limitation of the automated 


- 215 - 



algorltNns 1s that features of potential patterns must be specified 
beforehand. The operator works under no such constraints. He Is given 
virtual freedom to explore any means which will lead him to make a 
decision. These Include 

3 archives 

- E'**1re Observation Archive 

- Special Interest Archive of previous observations 

- Pattern Block Display 
RFI detection system memory 

Radio Astronomy System, Radio Astronomer 
Software routines emulating system algorithms 
High level software routines of his own design 
His own memory, Insight and Intuition 

In conclusion, the free mode allows the detection system to operate at 
its best, combining the broad and diligent scrutiny of the machine, with the 
wisdom, judgment, and inventiveness of the human. 
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ConpaHson of the Fixed and Free Modes of Operation 


Characteristic 

1. Nature of Interaction 

2. Information Needed 


3. Operator Task Load 

4. Operator Task Sequence 

5. Operator Task Mode 


Fixed Mode 


- Directive 

- Evaluative 

- Seard) Process Stei» 

- Feedback; Errors 

- Time Elapsed 

- Time Remaining 


• Low 

- Fixed 

- Forced Paced 


Free Mode 


- Informative 

- Responsive 

- Search Options 

- Archival Records 

- Current Search 

Record 
-Time Elapsed 

- Time Remaining 

- Low to High 

- Variable 

- Self-Paced 
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Chapter 6 

Radio Astronory Applications - Molecular Spectroscopy 


Some thills you miss bocause ^ey'ra so tiny you 
ovtrlook thfffl. But some things you don't see 
because they're so huge. We were both looking at 
the sune thing, seeing the same thing, talking 
about the same thing, thinking about the same 
thing, except he looking, seeing, talking 
and thinking from a completely different dimension. 

—Robert Pirsig. Zen and the Art of Motorcycle 

Maintenance 


of the time of this writing, tiiere are nearly 50 molecular species. 
Icxfts. and radicals that have been found In tiie Interstellar medium by radio 
astronomers. Observations of the lines from these molecules provide Impor- 
tant istrt^hyslcal Information ^out the sources In which they are fouml. 

As well ^ being discovered In gas and dust clcnids In the Interstellar 
medlun. molecules have also been observed In circunstellar shells. There Is 
a constant recycling of matter between stars and Interstellar space, with 
new stars being bom from the gas and dust and old and dying stars releasing 
matter back Into the Interstellar medium, molecules have also been observed 
in circumstellar shells. There Is a constant recycling of matter between 
stars and InterstelU.' space, with new stars being born from the gas and 
dust and old and dying stars releasing matter back Into the Interstellar 
medium. Currently we estimate that about 90% of the mass Is In stars and 
the rest In gas and dust that occupies the volum between the stars. 

From many of the compounds detected, radio astronomers are able to 
observe several different transitions or lines. From the Intensities and 
widths, physical conditions such as temperature and density may be probed. 
Calculations can be made to estimate the colimin density of a specific mole- 
cule and hence provide relative Sundance Information. Abundances are 
better determirrad when several lines are available from different energy 
levels so that the population statistics In various energy levels can be 
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e$t1n»ted. The presence of a molecule together with its abundance provides 
fundamental data for Interstellar chemistry, that Is the reactions and rates 
responsible for production and (testructlon of the molecules. 

For each molecule present there Is also Information that Is automatically 
available for velocity along the line of sight to the observed region from 
observations of the Doppler shift, A grid of observations In a molecular 
line then provides us a view of the kinematics of the region. In addition 
to the nearly 50 different molecular species found to date, which so far 
are composed only of atoms of H, C, N, 0, Si, and S, there are also 
Innumerable Isotopes discovered. These add to the list of atoms the 
following: 0, “C, *®N, ”0, *®0, **S1, *®S1, ”S, and ®'*S. By observing 
line ratios of different isotopic species It Is possible to obtain Isotopic 
abundances when the cloud is optically thin. In turn, the nucleosynthetic 
history of our galaxy Is highly dependent on Isotope ratios such as **C/ 
and *’0/ (see for example Audouze, J., and Tinsley, B.M., 1976 Annual 
Review Astronomy and Astrophysics). 

The characteristics of the MCSA/SD make It uniquely useful for some radio 
astronomy applications. In other Instances where It is not necessarily unique 
It will be a useful back end for line work. Since we feel the primary use 
of the instrument will be In conjunction with SETI in the water hole, we 
emphasize work that ccHild be done In this frequency range. However, the 
MCSA/SD has much more general capability so we also discuss projects of 
Interest that are not restricted to the water hole. In addition the MCSA 
being built by JPL Is different from that being built for NASA- Ames, and 
different problems lend themselves better to one or the other. 

Table 6.1 contains a list of molecular lines In the water hole observed 
in one or more astronomical sources. We list In Table 6.2 other lines In the 


water hole that have not yet been detected astronomically but are of astro- 
physical Interest since other lines from the species or an Isotope have 

already been observed. The ultra high resolution capability will in most 
irstances not be necessary for work on these lines. Thermal widths in 
typical cold clouds are larger than about 1 KHz white the actual widths in 
even the narrowest lines will be larger because of turbulent broadening. 

In a search for new lines the JPL NCSA will be extremely powerful. With a 

total bandpass of 300 NHz and 10‘ channels, the spectral resolution will 

be %0 Hz. This provides sufficient resoluticm while cwering a bandpass 
as large as tl^ waterhole in one setting. In this regard an instantaneous 

bandpass of 500 fWz vrould be even more desirable. The detection of new 

molecules will provide additional pieces in the puzzle of the young science 
of astro-chemistry. Radio astronomers are on the verge of detecting amino 
acids in space. Glycine, the simplest amino acid, has 10 atoms and a mole- 
cualr weight of 75. Molecules as complicated as 11 atoms and a molecular 
weight of 126 have already been found by radio astronoiny spectroscopy. In 
addition, all of the chemical bondings that are present in glycine have 
been seen in at least one of the molecules found already. 

Current OH surveys in our galaxy have been limited to ±2’’ of the galactic 
plane and have been unable to cover high velocities because of limited band- 
width, With the JPL NCSA all four OH lines could be covered simultaneously 
(as well as lines of the isotopes) perhaps making it worthwhile to undertake 
a major new OH-survey, that would also pick up the high velocity OH. As 
lc>ng as ¥« know ti» positiwis of OH to within 1' - 2' very useful polariza- 
tion work could be done on the OH masers. These new observations could 
provide the necessary resurgence for theoretical wwk on classification 
schenes and pumping mechanisms. 
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An Interesting experiment with the NASA-Ames 1CSA will be to atten^t 
to find Zeeman splitting In OH. Crutcher and Helles (private communication) 
feel they have a possible first detection of this for OH from a dark cloud 
In Taurus seen In absorption against the source 3C133. Thoy observed both 
right hand circular (RHC) and left hand circular (LHC) polarization with 
the Arecibo telescope. With two ^KISA's providing RHC and LHC polarization, 
we would subtract one spectrun from the other and look for the slightly 
shifted lines which would manifest themselves by a sideways S profile. The 
height of the ripple is proportional to the magnetic field, and the frequency 
difference between maximum and minimum is equal to the full width at half 
maximimi, FWHM, of the line. The narrowest lines found so far in 18-cm OH 
are about 0.5 Km/s or 3 KHz wide. This is the case for the dust cloud L134N. 

This is considerably greater than the thermal width which for a lOK cloud is 
about 0.1 Km/s. Chances of successful detection will be enhanced when 
clouds are discovered that are both colder and less turbulent or have a larger 
magnetic field. The Zeeman splitting is 3.27 Hz/y6 for the 1665 MHz line and 
1.96 Hz/pG for the 1667 MHz line (see e.q., B.E. Turner and G. Vershuur ApJ 162 , 
341, 1970). Zeeman splitting in the 21-cm HI line has been measured in a 
nunt>er of sources since the first detections by Vershuur (ApJ 156, 861, 1969 
and Nature 223 . 140, 1969). The splitting Is 2.8 Hz/pG. Further work could 
profitably be done on HI as well as on other molecules with an unpaired 
electron spin such as CN, CH, and SO that would have comparable Zeeman splitting. 

The spectral resolution provided by the MCSA/SD will easily resolve 
nuclear quadrapole hyperfine splitting. This has already been done for forma- 
mide but not yet for * formamide (see Table 6.1). The latter would require 
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a more sensitive system than avall^le to the French group that measured 
one conponmt of NH^^’OOH. Even Mhmn ttm rotaticmal Mergy levels are not 
|KK>u1ated according to local thermtKl^nMlc equilibrium LTE as Is the case In 
formmnlde, the hyperfine comporante ap(»ar to have relative Intensities 
ccmformlng to LTE. This would be Interesting to c(»rroborate and further 
check In other molecules. Die NCSA/SD system would be particularly useful 
for more closely spaced hyperfine lines such as the spin-rotation lines of 

the 1 -►I transition at H CO at 6-cm. The F„ -► F, lines 0 -► 1 and 

10 11 z u L 

2 2 are separated by about 1 KHz. Astrcmomlcally, they appear to be In 

LTE. 

As mentioned above the nucleosynthetic history of our galaiqr depends heavily 
on ratios of line intensities of varloic Isoti^lc sf^cles. According to 
A. A. Penzias (lAU Sympi^lum No. 87 Interstellar Molecules meeting 8/1979 to 
be pidillshed) the Interstellar abundances ratio Is ^out 150X of the 

terrestrial value of 1/89 throughput the plane of our galai^ and about twice 
the terrestrial value near the galactic center. “C Is a primary element 
formed directly from H and He Inside a star while Is considered a secondary 
element produced from primary elements In secorwl and later generation stars 
by the S-process. Table 6.1 and 6.2 show that Isotope experiments could be 
done with lines In the water hole. 

The NCSA coupled with the Signal Detector would provide a valuable tool 
for radio frequency pulsar Investigations. This apparatus makes possible 
the high resolution power spectrum monitoring of pulsar pulses, as well as 
a frequency domain record of Die type and degree of polarization (stokes 
parmneters) associated with a single pulsed period. The following experiments 
are suggested ai worthwhile: 
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1. Measuring the bandwidth of micropulses and nano- 
pulses within a single pulse. 

2. Using Power spectrum, or autocorrelation functions 
derived from the Power spectnw, to determine the 
relative contributions of sub-pulses, micropulses, 
and nanopulses to the total Integrated single pulse. 

3. Exploring the possibility of extraterrestrial 
transmitters being located near a pulsar, using 
the pulsar as the signpost advertising a beacon 
signal. 


Radio Recombination Lines 

Since radio recombination lines are very wide there Is no need to observe 
them with the MCSA/SD having 8 MHz bandpass and 8 x 10® channels. Or> the other 
hand an instantaneous bandpass of about 300 MHz with 10® channels would provide 
a unique combination for recombination line experiments. For example, the 
entire water hole could be covered with a single bandpass that would 
provide simultaneous observations of the following hydrogen, helium, and 
cart)on recombination lines: 


a - 

lines 

156 - 

166 

B - 

lines 

196 - 

209 

Y - 

lines 

224 - 

238 

a - 

lines 

247 - 

282 

e - 

lines 

265 - 

282 


Rest frequencies for these lines can be found In (A.E. Lllley and P. Palmer 
ApJ Supplement 144, 16:143, 1968). Such observations would provide better 
data in the sense of good Internal calibration because of the simultaneity of 
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the observations. A new level of accuracy may then be brought to bear on 
such familiar problems In gaseous nebulae as: 

1) local thermodlynamic equilibrium (LTE) and departures therefrom. 

2) electron temperatures 

3) line broadening - Stark effect 

4) gas dynamics - within an Individual object as well as an aid In 
galactic structure determination. 

5} relative abundance of helltan to hydrogen with Its cosmological 
significance. 

6) carbon line studies to determine regions of emissions compared to 
hydrogen and helium, and relative abundances. 

For additional commentary on the above avenues of research see for example 

R.L. Brown, R.J. Lockman, and G.R. Knapp, Ann. Rev. Astron. Astro^^ys 16. 445, 1978. 


- 227 - 



Lines In WaUr Hole 


1.42 « 1.72 6Hz A1rea(<y Detected Astronomically 


1 

» (GH 2 ) 

Molecule 

1.538 

135 

NHgCHO 

1.536 

693 

Formanide 

1.539 

295 

Formamlde 

1.539 

570 

Fomaml de 

1.539 

851 

Formamlde 

1.541 

018 

Formami de 

1.570 

825 

NHg^^CHO 

1.610 

249 

HCOOCH 3 



Methyl Formate 

1.610 

906 

Methyl Formate 

1.612 

2310 

OH 

1.624 

518 


1.626 

161 

”0H 

1.637 

564 

>«0H 

1.638 

805 

HCOOH 

1.639 

503 


1.665 

4008 

OH 

1.667 

3590 

OH 

1.720 

5300 

OH 


Transition Reference 

Ho-Hi 

1-2 

2-1 

1-0 

2-2 

0-1 


ho-' 

11 

2-2 8 

ho"' 

11 ^ 

2.3 

ho-' 

11 ' 

2.3 

’’ 3/2 

J»3/2 

F»l-2 11,13 

^’’ 3/2 

J»3/2 

F.Fj»7/2.4-7/2,4 1.5 

2 ^ 

" 3/2 

J»3/2 

F.Fj«9/2. 4-972-4 1.5 

2„ 

^3/2 

J*3/2 

F*l-1 4,6,10,15 

' 10 -' 

11 

17 

2_ 

^3/2 

J*3/2 

F«2-2 4,6,10,12,15 

2„ 

J«3/2 

F=l-1 11,13,14 

2, 

^3/2 

J*3/2 

F*2-2 11,13,14 

2, 

^3/2 

J«3/2 

F-2-1 11,13,14 


Table 6.1 
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Lines In Wnter Hole of Potential Astroohyslcal Interest (Hot Yit found) 


V (GHz) 


Molecule 

Transition 


Refereno 

1.455 

7290 

* 

I'oi 

^2^2 J*3/2 

F,Fj 

•5/2, 2-3/2. 1 

1,12 

1.474 

787 


HgCS 

^°2,8‘^°2,9 



1.7 

1.501 

250 


NH^CHO 

194.15-1^.16 



1.7 

(unsplit - 

will 

have hyperfine components) 




1.547 

377 


HgCO 

1®4.14-1®4.15 



1.7 

1.549 

798 


HgCS 

^^3.2r^S,22 



1.7 

1.584 

2740 


^®0H 

2„3/2 >3/2 

F»l- 

2 

1.12 

1.596 

061 


HDS 

^3.0" ^3.1 



9 

1.604 

270 


HNCO 

^1 , 3 -^ 1 ,4 


10.16 

(unsplit - 

will 

have hyperfine components) 




1.629 

515 


NH 2 CHO 

1 ^ 3 . 9 - 1 ^ 3.10 



1.7 

(unsplit - 

will 

have hyperfine components) 




1.636 

816 


HDS 

94.1-94.2 



9 

1.655 

499 



%/2 0*3/2 1 
1 


5/2 .3-5/2 .3 

12.1 

1.656 

542 


l^OH 


7/2 ,3-7/2 ,3 

12.1 

1.659 

577 


H2^^C0 

II 3 . 8 -II 3.9 



1.7 

1.683 

540 


^^OH 

0*3/2 

F.Fj 

•3/2, 1-3/2. 1 

1 

1.684 

542 


I'OH 

F.F, 

•1/2.1-1/2.1 

1 

1.692 

7950 


^®0H 

\/Z J‘3/2 

F*2- 

1 

1 


Only ground vibrational state has been cons1<tered here. There are other lines 
that c(Hi1d be of astrophysical interest in excited vibrational states such as 
several lines of HC^N. 
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Why do we wish to bear forever the noise 
of these more than another noise so close 
to our dwelling place. 

—Robert Frost 


7.1 - Introduction 

The unprecedented combination of broadband coverage and high frequenc;y 
resolution In the multichannel spectrum analyzer and signal detector provide 
a means for previously unattainable sensitivity In signal analysis. The 
criteria used In the choice of nonastrophysical applications for the system 
are: 1) the signals to be monitored must have a broad total bandwidth, 
whidi contains narrowband spectral components of Interest, and 2) the signals 
must have a low signal to noise ratio. 

Some of the most promising non- as trophy si cal applications of the multi- 
channel spectrun analyzer and signal detector are In the field of acoustic 
or microselsmic monitoring. There are currently many ongoing research 
projects, and nwerous diagnostic Incipient failure tests and non- destructive 
tests employing acoustic technology, which are producing a wealth of useful 
data. The two major Impediments in this relatively new and very promising 
field are frequency resolution and acoustic emissions with low signal to noise 
ratios. A sensitive signal detector with high frequency resolution might 
Illuminate many of the microselsmic or acoustic emission phenomena that are 
only partially uncterstood at the present time. 

Acoustic or microselsmic emissions are transient elastic waves generated 
by rapid release of energy during dynamic processes In materials. Localized 
transient instabilities such as crack propagation, dislocation motion, and 
microfracturing In rocks and metals or fiber fracture and crack propagation in 
polymers and fiber-reinforced plastics (FRP) are ex»nples of the dynamic 
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processes which produce acoustic em1ss1of». The transient elastic waves 
or acoiatic emissions are detected with piezoelectric transducers which 
w«Mivert them to electrical signals whidi are then analyzed In a variety of 
ways tdtich will be discussed later. The sensitivity of acoi^tlc emission 
analyzers to localized transient 1rstal)111t1es allows prediction of catas- 
trophic failure. Although the stresses In the system are below the 
eli^tlc limit of the material* the localized stresses In t1^ region around a 
flax or crack may be higher than the elastic limit, due to stress concentra- 
tion, which results In propagation of the flaw or crack. In most material 
system catastrophic failures are often the result of numerous localized 
inste^mtles which coalesce, and therefore the ability of acoustic 
emission analysis to detect these localized failures which occur at stresses 
well below the elastic limit of the material makes prediction of catastrophic 
failure possible. 

In addition to failure prediction, acoustic emission technology has a 
variety of useful applications. Leaks In pipes, reservoirs, and dams have 
been successfully detected and located with acoustic emission analysis. Flaws 
In pressure vessels, reactor vessels, and a variety of other structures have 
been detected and located with a variety of acoustic non-destructive testing 
(NOT) methods. 

The use of acoustic monitoring was Initiated In the late thirties and 
early forties by Obert (1939, 1940, 1941), Obert and Duvall (1942, 1945, 1945a) 
Hodgson (1943) and Hodgson and Gibbs (1945), In studies of rock bursts and 
pillar failures In mines. This early work was crude and employed a low 
frequency resonant v^eophone, an audio amplifier, and a headset (audible 
frequencies were the only frequencies monitored). Kaiser (1950, 1953) and 
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his oworkers In Gennar\y In tiie earlier fifties are generally credited with 
the first rncKtem acoustic studies In work on metals, Kaiser reported that 
all n«ta1s. Including steel, zinc, copper, aluminum, and lead produce acoistic 
onlsslor^ under load. He also reported that acoustic phenomena were Irrever- 
sible, that Is, acoustic emissions do not occur In cyclic loading tests 
unless the load In the cycle exceed^ that In all previous loading cycles. This 
pehnomenom Is called the Kaiser effect. Schofield (1958) and Tatro (1959) did 
imjch to In^rove the sophistication of acoustic monitoring Instrumentation and 
experimentation In the mid to late fifties with i'esearch on the sources of 
emisslors. They found that acoustic emissions In metals were produced by the 
dislocation movement which accompanies plastic deformation In such materials. 
Dunnegcn Tatro, and Harris first investigated the use of Instrunentatlon 
wh(se frequency range was well above the audio range. The use of equipment 
with this range eliminated many of the problems early workers encountered 
with excessive background ambient noise. 

There are numerous present areas of application of ac(MJSt1c or microselsmic 
analysis. Brittle deformation processes In geologic materials have been 
greatly lllwninated by studies of microselsmic emissions. Scholz (1968) 
proposed that acoustic emissions in polycrystalline geologic materials were 
produced by microfracturing of individual grains, and that failure In such 
materials (by coalescence of microfractures) could be predicted on the basis 
of the acoustic count rate. This work is further supported by Martin and 
Durham (1975) and Dunning and Dunn (1978), in work on crack propagation In 
quartz. Lockner, Walsh, and Byerlee (1977) and Gupta (1973) have Investigated 
changes In velocity of acoustic emissions in rocks undergoing deformation. 
Byerlee and Lockner (1978) and Blade and Malone (1977) have Investigated the 
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acou$t1c nature of shear failure and fracture propagation. Byerlee and 
Lockner (1977) and Hardy (1975, 1978) have found that fluid or gas pressure 
In geological materials can be estimated w) the basis of acoustic activity. 

The acoustic nature of slope Instability In rock, soil, and snoM has been 
investigated by Koemer and Lord (1974, 1975) and Sommerfleld (1978). There 
are many other examples of acoustic applications in geologic problem solving 
which, for the sake of brevity, cannot be covered In this report. 

Acoustic emission techniques are also well -suited to probl^ solving 
studies In material research, structural integrity analysis, and otiwr areas 
of Industrial research. Liptal (1972) Investigated the acoustic nature of 
failure In fiberglass- reinforced epoj^. Tetelman, Harris and Darwish (1972) 
conducted similar experiments with whisker-reinforced conqjosites. Carlyle 
(1975), Hamstad and Chlao (1976), and Fowler and Gray (1979) have characterl zed 
tlte acoustic nature of failure In fiber- reinforced plastics. Their work 
Indicates that the different mechanisms Involved In failure of these 
materials; such as fiber failure and glass cracking, can be differentiated 
on the basis of acoustic signature, signal amplitude, and power spectra. 

Hamstad and Chlao (1976) and Mitchell (1979) have Investigated non-destructive 
tests fcr fiber-reinforced plastic structures and pressure vessels. Hartbower 
et a1 (1972) and Wingfield (1972) have demonstrated that weld-cracking can be 
detected and located, and that the structural Integrity of welds can be 
tested with acoustic emission techniques. Acoustic emission technology has 
been applied In the aerospace and aeronautics Industries with success by 
Moore (1970), Green et al (1970), and Nakamura et al (1972). Acoustic analysis 
of on-line nuclear reactor vessels has been emplqyed by the Connecticut Yankee, 
and Elk River power plants (Mhylle, 1971). The deformation of concrete and 
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and other building materials has been characterized acoustically by LI et al. 
(1970). The mining Industry has made extensive and successful use of acoustic 
monitoring technology. Leighton and Blake (1970) and Krauland and Westerberg 
(1977) have shown that rock bursts, roof bounces, and pillar failures can be 
predicted In some Instances on the basis of Increased acoustic activity 
pn’or to the event. 

7.2 - Acoustic Em1«;s1on Analysis Techniques 

Acoustic activity can be measured and analyzed in a variety of weys. 

Early workers employed threshold counting techniques, which consist of 
accumulating a running count of the number of times the amplitude of the 
signal from the transducer exceeds a predetermined threshold. It has been 
shown by numerous experimentalists that, in most poly crystal line materials, 
the count rate (counts per unit time) rises sharply just prior to failure. 

In some cases the use in count rate can be utilized to predict failure. This 
technique has some deficiencies, however, which limit its effectiveness. A 
single acoustic burst of high amplitude may exceed the threshold several 
times, resulting in a spurious value for total acoustic activity, while 
acoustic events with lower amplitude produce a single count. 

Furthermore, in materials that have been loaded in a cyclic manner, the Kaiser 
effect often precludes prediction of failure if the material fails at a 
load level attained In a previous cycle. 

Another technique commonly employed is burst or event thresholding, which 
entails accumulating the count rate of acoustic events or bursts above a pre- 
determined threshold. This method has the same limitations as simple thres- 
holding, however the count rate obtained in event thresholding is more 
accurate and not as sensitive to the level of the threshold. 


- 236 - 


The maximum signal amplitude during a single event Is a ctxnmonly employed 
accxistlc analysis technique. Signal mnplltucte measurements are useful In ^ 
laboratory where sample homogeneity Is not a problem or In the field when 
relative changes In amplitude as a function of time for a single transducer 
are of Interest, however In other cases signal amplitude a1or« Is not a 
particularly Informative parameter. 

A number of workers have Investigated the frequency characteristics and 
power spectra of Individual acoustic events and cimulatlve acoustic events. 
Suzuki et al (1964) and Chugh et al (1972) found that acoustic events In rocks 
undergoing deformation had a regular and predictable power spectra. Chugh 
et al (1972) found that the frequency of emissions shifts upward as a function 
of stress level, as the failure stress Is approadied. Chugh et al propose that 
spectral data was more useful when presented as an energy distribution ratio 
which. In that study, was defined as the ratio of acoustic energy In the 
range 500-5,000 Hz, to that observed in the range of 10,250-15 ,(M)0 Hz. In 
general, the energy distribution ratio decreases with Increased stress for a 
variety of rock types. Graham and Alers (1975) found that many metals and 
composite materials have specific and unique acoustic emission spectra. They 
also showed that It was possible to detect failure and Identify component 
noises In steemi turbine equipment based on the spectra of emissions. Fowler 
and Gray (1979) and others have shown that fiber failure, crack propagation 
In glass and other failure mechanisms In fiber- reinforced plastics can be 
identified by their spectra. A typical frequency-versus-amplltucte plot 
reveals such flaws. The major drawback of spectral analysis Is the lack of 
frequency resolution attainable with available off-the-shelf spectral analysis 
equipment. Most workers Involved with acoustic spectra feel that frequency 
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resolution Is tne R*.s1n Impediment to solution of a number of acoustic 
problems. Many materials have rather wide spectra (up to several MHz) 
which cannot be observed In total with very high frequency resolution. 

Source location analysis Is a recently developed acoustic technique 
that appears to be promising. Source location techniques employ an array 
of three or more transducers which are located at various points on the 
body of Interest. Acoustic activity from a source Is detected by each 
transducer at a different time depending on the distance from the source to 
the transducer. A triangulatlon technique based on the differences In arrival 
times onong the transducers Is used to locate the acoustic source 

empirically. Mogi (1968) and Scholz (1968) have used this technique on a 
variety of rock types with some success. Byerlee and Lockner (1978) have 

used the source location technique to map fracture propagation In granite 
in the most sophisticated geologic source location study yet attempted. 

Source location techniques at'e commonly used in Industry to identify flaws and 
fractures (Hamstad and Chiao, 1976; Mitchel , 1979; and Wingfield, 1972). Source 
location techniques are employed in mines to predict and locate rock bursts, 
Blake (1977) and Leighton and Steblqy (1977). An excellent description of 
this technique is found in Kelly and Schlamp (1977). 

Seismic velocity attenuation analysis is another promising acoustic 
technique. Lockner et al (1977) have (temonstrated that the seismic velocity 
of accMistic signals decreases up to 30% immediately prior to failure in 
granite. They attributed this attenuation to the presence of numerous micro- 
cracks. Hadley (1975), Merkulova et al (1972), and Bom (1941) have also 
Investigated this phenomenon with similar results. It has also been observed 
that seismic velocities are often attenuated In the area of the focus just 
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tite signature of acoustic emissions ger«rated during sliding friction 
tests on granite was similar to the signature of some earthquakes. 

7.3 - Possible Applications of the Signal Detector In Accxistic Emission 

The two areas of Interest In acoustic monitoring which might be greatly 
Illuminated by the multichanne'' spectrum analyzer and signal detector are 
spectrum analysis and low SMI signal analysis. Spectrum analysis at the 
present time Is limited by frequency resolution. The acoustic spectra of 
many materials undergoing deformation have bandwidths of a megahertz or 
more, while the frequency resolution of presently available off-the-shelf 
equipment with bandwidths in this range is on the order of 10 kHz. There 
are undoubtedly numerous narrowband acoustic spectral components that cannot 
be effectively analyzed with 10 kHz frequency resolution. It can be shown 
that the effective SNR of a signal whose bandwidth is narrower than the 
frequency resolution of the analyzer becomes 

CMD_§§_ where Fr = frequency resolution of the analyzer 

5'*'^FF7FF (7-1) 

Bs • bandwidth of the signal 

This means that narrowband signals of interest may be lost because of poor 
frequency resolution. The frequency resolution of the multichannel spectrum 
analyzer is high enough to effectively investigate narrowband acoustic 
spectral components. 

There are numerous instances, especially in filed projects, when the 
ambient noise is of such great amplitude that many acoustic signals of interest 
are well below the detection threshold. Acoustic analysis of mines, under- 
ground reservoirs, and oil fields is often hindered by high amplitude noise 
and a low SNR as are laboratory investigations of the Kaiser effect, crack 


and fracture propagation, and hydrofracture. The high sensitivity signal 
detector proposed in this study would greatly enhance our ability to effect- 
ively Investigate such phenomena. Some specific types of investigations In 
industry and geophysics that would be greatly enhanced by the multichannel 
spectrum analyzer and the signal detector will be covered in the next two 
sections. One limitation of the system, however, is the low time resolution 
of the spectrum analyzer. Transient emission of much less than a second in 
duration would not be detected by the signal detector unless the pipeline 
capability of the spectrum analyzer could be implemented. 

7.4 - Industrial Applications 
Source Mechanism Studies 

It has been found by McCabe and Koerner (1979) that the frequency of 
acoustic emissions produced by crack propagation In coal Is shifted downward 
as the crack propagates, and the source area is enlarged. A rigorous analysis 
of this apparent phenomenon could result In an effective method for monitoring 
acoustic sources in order to determine if the flaws producing acoustic signals 
are growing. The sensitivity and frequency resolution of the proposed signal 
detection system would be ideal for such an investigation (an experimental 
procedure outline for a feasibility study of this phenomenon is presented in 
Appendix A7.1). Source mechanism studies of bearing failure in rotating equipment 
have so far shown some promise. The spectra of emissions of a bearing under- 
going failure can be used to identify the type of bearing, and source location 
analysis can locate the particular bearing of interest (Pollock, 1979). Source 
mechanism studies are also of importance in pressure vessels and nuclear reactor 
vessels. The location, spectral properties and amplitude attenuation of acoustic 
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signals generated by a flaw or fracture, or the velocity, frequency* and 
ainplltude attenuatloi (If the coupling parameters are well understood) of 
Induced acoustic signals may allow workers to Identify and characterize flaws 
In such vessels during on-line operation. The signal detection system 
proposed In this study and the multichannel spectrum analyzer would be capable 
of such studies. 

Failure Mechanism Studies 

The failure mechanisms In metals, plastics, composite materials, and 
building materials may fall into specific categories based on spectral 
components, signal amplitude and spectral shift characteristics. It has 
been shown, for Instance, by Fowler and Gray (1979) that fiber failure 
can be differentiated from glass cracking In fiber-reinforced plastics 
based on relative amplitudes and spectral components. Further Investigation 
employing a sensitive detector with high frequency resolution may show that 
Intercrystal line fracture, grain boundary deformation, and dislocation move- 
ment for various types of materials can be uniquely characterized by the 
parameters listed previously. High resolution and high sensitivity studies 
such as these might also reveal that there Is a unique frequency component 
for each material that appears Imredlately prior to failure, the knowledge of 
which would greatly enhance the ability to predict failure. It has already 
been established that the flow mechanisms In plastics can be differentiated 
and Identified by characteristic signatures of acoustic activity In frequency- 
time space. The signal detection system and multichannel spectrum analyzer 
proposed In this report could, quite possibly, provide extremely In^ortant 
Information on failure mechanism characteristics. 
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Leak Detection 


It has bee:i demonstrated by Parry (1971) In pressure vessels and pipes 
and by Koerner and Lord (1974) In earthen dons, that leaks can be detected 
by analyzing the spectral components and frequency ratios of acoustic 
missions. Leaks are often manifested by a single or group of specific 
spectral conponents, the characteristics of which are apparently dependent 
on the pipe, vessel, or dam material and the leaking medium. The signal 
amplitudes of acoustic emissions resulting from leaks are often quite low, 
a problem which could be ameliorated to a great extent by the sensitivity 
and frequency resolutim of the proposed signal detection system. 

Kaiser and Felicity Studies 

The apparently Irreversible nature of acoustic activity was discovered 
by Kaiser (1950). The Kaiser effect is the immediately irreversible nature 
of acoustic emission activity resulting from applied stress. If the effect 
is present, there is little or no acoustic activity until previously applied 
stress levels are exceeded. This effect is not extremely well -understood for 
some materials. The Kaiser effect in rocks and concrete may be due to the 
fact that acoustic emissions are produced by microfracturing, and in cyclic 
loading now microfractures are not produced until previous loads are exceeded. 
Previous investigations of the Kaiser effect have employed thresholding tech- 
niques which may not have detected low amplitude signals which possibly occur 
under cyclic loading at stresses below previous stress levels. An investigation 
of this phenomenon with the multichannel spectrum analyzer and signal detector 
may uncover low amplitude signals or characteristic spectral components that 
will enable us to better understand the Kaiser effect. 
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The felicity effect Is a phenomenon which occurs In plastics under 
cyclic loading. The felicity effect results In significant emissions at 
stress levels well below previously attained stress levels. It Is believed 
that the felicity effect Is caused by Isoelastic deformation during 
unloading which effectively anneals previously formed deformation structures. 
A highly sensitive high frequency resolution analysis of this phenomenon mqy 
reveal the deformation mechanism which produces the felicity effect. 

Thermoplastic Studies 

The failure of thermoplastics and glasses cannot presently be predicted 
with acoustic monitoring techniques because acoustic emissions are either 
absent or low enough In amplitude to escape detection. Glass sometimes 
displays acoustic activity during crack Initiation, however, this only occurs 
when the crack propagation Is unstable. The high frequency resolution and 
sensitivity of the multichannel spectrun analyzer and signal detector pro 
posed In this study may enable detection of acoustic emissions In these 
materials. If they are present. 

There are, undoubtedly, numerous other Industrial applications In the 
field of Industrial acoustic emission monitoring for which the signal detec- 
tion system proposed in this study may be well-suited. The applications 
discussed In the section are, however, of particular Interest In this field. 

7,5 - Geophysical Applications 
Fracture Density Studies 

Fracture density Is of great Interest to geophysicists, structural 
geologists and petroleun geologists. Fracture density In fault zones appears 
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to be an Important parameter In fa"U mechanics, a clear understanding of 
which may lead to significant advances In the field of earthquake prediction. 
The density and nature of fractures In petroleum reservoirs are two of the 
most Important factors In effective tertiary recovery of oil In fractured 
reservol rs. 

Present fracture density studies are Implemented In a variety of ways. 

Many geoF^ysIcIsts and petroleum geologists Investigate seismic velocity 
attenuation. In this test an ultrasonic or shot blast signal Is activated 
and the travel times of the elastic waves from the shot site to the sites are 
recorded and compared to the predicted travel times for the unfractured rock 
type of Interest. If the fracture density Is high, the velocity of the elastic 
waves will be greatly attenuated from the predicted value. This method has 
some limitations, most notably sensitivity to fracture filling. A fracture 
which Is open and filled with gouge or fluid will produce the same velocity 
attenuation as numerous closed fractures. Another parameter measured by 
geophysicists and petroleum geologists and engineers Is the Q factor which Is 
basically a measure of the energy loss of an Input ultrasonic or shot signal. 

A highly fractured body of rock will have a lower Q value than unfractured 
rock of the same composition. This method has the same basic limitations as 
velocity attenuation because the parameters used to calculate both are basi- 
cally the same. Measurement of amplitude attenuation as a function of 
fracture density has been attempted In the laboratory, however the uncertainly 
involved in determining the coupling parameters of the transducer and the rock 
makes field utilization of this technique somewhat difficult. Power spectral 
analysis has been employed with some success in recent years. It has been 
found that high fracture density results In a downward frequency shift of the 
power spectrum. 
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The frequency and amplitude resolution of the multichannel spectrum 
analyzer and signal detector alloM more detailed and sophisticated analyses 
of fracture density* Wider bandwidth Input signals can be employed and 
frequency attenuation, amplitude attenuation (If the coupling parameters are 
knoMn), and detailed poMer spectra measurements would be possible. An 
analysis of Q as a function of frequency might also be productive because 
Aki (1979) has found that Q varies as a function of frequency to some 
extent. In addition to varying as a function of fracture density. 

Stress Field Analysis 

Accurate measurements of stress fields and stress orlentatlms are of 
great Importance to geophysicists, petroleum engineers, and mining engineers. 

A sudden change In the stress field of a fault zone may be a premonitory 
event, or an unexpected variation of the stress field with depth may result 
in expensive drilling equipment damage in an oil well. Rock bursts and other 
mine failures are often preceded by rapid variation in stress field Intensity 
and orientation. Presently such techniques as overcoring, hydrofracture 
strength analysis, and extensometer and strain gage analysis are used to 
measure stress field orientation and magnitude. An interesting application 
of the multichannel spectrum analyzer and signal detector would be an analysis 
of the acoustic emission activity, under loading, of cores cut from fault 
zones, mine walls, or drill cores order to determine the stress field by 
investigation of the Kaiser effect. The Kaiser effect could be used as a 
stress level Indicator of great sensitivity. One major limitation of the use 
of the Kaiser effect to determine stress levels is the fact that the stress 
values calculated by determining the stress level at which acoustic activity 
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begins may be higher than the actual stress level. If the stress level In 
the rock was significantly higher at some time shortly before the cores 
were cut. Kaiser effect studies should, however, be undertaken because the 
spectrum analyzer and signal detector are well -suited for this type of Inves- 
tigation and Kaiser effect measurements may prove to be an excellent and 
Inexpensive measurement technique. An experimental procedure outline for 
a feasibility study of an Investigation of Kaiser effect as a stress 
Indicator Is presented In Appendix A7.1. 

Source Mechanism Studies 

It has been established that the source mechanism of acoustic emissions 
In metals and plastics can be determined on the basis of frequency and ampli- 
tude. Fiber fracture, dislocation movement, crack propagation, and grain 
boundary deformation all have different amplitudes and spectral components. 

It Is probable that the sane situation exists In rocks; source mechanisms 
such as dislocation movement, microfracturing, grain boundary deformation, 
and Inter-crystalline crack propagation should be distinguishable on the 
basis of spectral components and relative signal amplitudes. This has not 
been Investigated as of yet because the spectrum of acoustic emissions In 
rocks Is wide enough In bandwidth to preclude high resolution spectral 
analysis with available equipment. 

Another type of source mechanism study that would be productive Is 
spectral analysis of a propagating flaw or fracture. NcCable and Koemer 
(1979) have found that the spectrum of emissions from a propagating cleat 
In coal shifts downward In frequency as the cleat dimensions Increase. This 
apparent phenomenon should be Investigated with a high resolution detection 
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system such as the one proposed In this study because If this phenomenon 
exists and can be clearly understood, significant progress may be made In 
prediction of pillar failures and rock bursts In mines. 

General Deformation Studies 

The acoustic nature of deformation processes In rocks Is not very well 
understood at the present time due In great part to the broad bandwidth of 
the emissions and the lack of frequency resolution of existing acoustic 
monitoring equipment. Nianerous types of Investigations of rock deformation 
could be carried out with the signal detection system propcsed In this 
report. Compressive and tensile failure tests might Illuminate the acoustic 
nature of the shift from microfracturing to cataclastic failure In rocks. 
The acoustic nature of hydrofracturing could well be delineated In a more 
rIgorcHiS manner with such a signal detector. Cyclic loadlr^ tests might 
Illuminate the acoustic nature of failure under cyclic loading. There are 
undoubtedly numerous additional studies that could be Implemented with the 
multichannel spectrum analyzer and signal detector proposed In this study 
that would greatly enhance our knowledge of deformation pr(x:esses In rocks. 
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Postlude 
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The operator powers up the Oasis system, mounts fresh tapes on 
the rep lay/ arch i ve drives, and places the system in self-diagnostic mode. 
Momentarily, the status display reports that all subsystems are functioning 
properly. 

Via a magnetic tape, the operator enters into the RFI 
catalog data concerning astrophysical signals in the portion of sky to be 
covered today. He now activates the local RFI recei ver which begins to 
dynamically update the catalog. 

Pointing the telescope at a known quiet portion of the sky, the 
operator places Oasis into threshold-setting mode. The control CPU makes 
minor adjustments to thresholds in the carrier wave detector, pulse 
detector and cluster detector systems. 

On this particular day, a radio astronomer is on hand to continue 
a high resolution OH survey along the lines of sight to be searched. 

She takes her seat at the radio astronomy terminal and loads a program 
on her minicomputer. 

The SETI operator, meanwhile, is specifying to the control CPU the 
protocol for making automatic entries to the system log and event archives, 
for generating graphic displays and audible alarms. 

When all is ready, he points the antenna to the first star (and any 
unseen surrounding system of planets) and the search begins. 

Digital data from multi-channel spectrum analyzers (MCSAs) connected 
to two orthogonally-polarized receiving systems coiimences to flow at the 
rate of 8 million complex spectral values per receiver per second in four 
parallel data streams into the NBIT detector, the pulse detector, the 
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carrier wave detector, and a 100 g1ga-b1t tape memory. In the MBIT system the 
stream divides again to supply 12 processors each at the same data rate: 

32 megaMords per second. The I^IT 1oM-thresho1d feature detectors are 
rKM providing the clustering system's histogram unit with numerous patterns 
to count. The actual cluster search will begin at the end of the 1000-second 
observation of this star. 

At the first 20-second decision point, the carrier wave 
processor makes several detections, but the control CPU matches these with 
known military radar signals in the RFI catalog so no further action is 
taken on them. Nor does anything unusual appear in the coarse- resolution 
spectrum the operator has chosen for his display. 

Two minutes into the search, the audible alarm sounds and an integrated 
coarse spectrum of somewhat different resolution appears on the graphics 
display. A small but noticeable peak is seen in the middle of the spectrum. 
H)e status display indicates that the binning system has made a detection 
and the moving system log and event archive tapes indicate that information 
is being written on them. 

It looks like a maser--a previously unknown one. "Do you have a maser 
on your screen?" he asks the radio astronomer. "Yes I do, now," she replies. 
"You should record it." "It already has been." 

He enters a comment in the system log. Over the remainder of the 
observation of the star the maser line shows more and more clearly. The 
operator elects to save the entire record on the total observation archive. 

The next few stars reveal nothing unusual. Later in the afternoon, 
the clustering algorism finds some extremely unusual concentrations of 
broadband pulse features in an observation of Epsilon Euridoni. The SETI 
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operator saves the total observation for further analysis, along with 
an image of the cluster as it appears on the display. 
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Detection Is, or ought to be, an exact science, 
and should be treated In the same cold and un« 
einotlonal manner. You have attempted to tinge It 
with romanticism, which produces much the same 
effect as If you worked a love-story or an elope- 
iwnt Into the fifth proposition of Euclid, 

— A.C. Doyle, The Sign of Four 


Appendix A2.1 

Ihfi Discrete Fourier Transform of Signals and Noise 


The Discrete Fourier Transform 

Assume that we have some time process x(t), either deterministic or 
stochastic, which ls sampled every T/N seconds where T Is the length of one 
complete record or realization, and N Is the number of samples per record. 

This Is Illustrated In Fig. A2.1.1. (In the MCSA which Is a part of the 
system under study here, two quadrature time signals are sampled in this way.) 



Figure A2.1.1 - Sampling a Continuous Time Function 
The Discrete Fourier Transform is defined as: 

Ti *lc , n = 0,1,2 N-1 (A2.1-1) 

If the X|^ are samples of a real baseband process with a maxltmjm 
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frequency w, the sampling theorem requires that the sampling frequency be 
at least 2W» or that In T seconds the nunfeer of samples be at least 2WT. 

If the X|^ are samples of a baseband process obtained from a bandpass process 
of bandwidth By, the number of samples required In time T Is at least 26jT 
but these samples are of the real and Imaginary parts of the complex 
envelope of the bandpass process, where these parts are sampled once every 
seconds. This Is called quadrature sampling (Peebles, 1979). 

The bandpass signal Is detected by multiplying It In parallel channels 
by cosW^t and slnW^t, where Is a constant angular frequency, and thus 
yielding the real and Imaginary parts of the complex envelope. These two 
signals are low pass filtered and then sampled to produce the real and 
Imaginary parts of the complex samples Xj^ used In Equation (A2. 1-1). 

The values of generated by Equation (A2.1-1) are complex numbers 
corresponding to the signal which Is In each of N frequency bins. (If, for 
example, a bandpass signal of bandwidth 10 MHz is to be divided Into one 
million 10 Hz channels, one million samples will be taken of the real and 
Imaginary parts of the complex envelope, and one million values of will 
be otained (N * 10*).) 


The bandwidth of each individual bin is: 


B * Sy/N (A2.1-; 

The center frequency of the n^*’ bin is: 

f = "®T = nB = «!J- , n = 0,1,2 N-1 

" IT ”'s 

where T^ = T/N is the sampling time which equals By^. Hence the output of 
the OFT is a set of N complex numbers each one of which is associated with 


a frequency bln with bandwidth B and center frequency f^. The baseband 
representation of this result is shown in Figure A2.1.2. 
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Response of the DPT to Sinusoid Signals 

Next we consider the case where the Input x(t) to the DFT Is a complex 
sinusoid of the form 

x(t) • 

where Is a fixed baseband frequency. A signal of this form corresponds 
to a sinusoidal signal in the original passband. In order to conveniently 
relate cyclic frequency f^ ( • W^/2ir) to the frequency bins shown In 
Fig. A2.1.2,we define f^ in terms of the blnwidths B as: 

FrArB (A2.1-5) 

Hence, for example, f^. lies within Bln 2 whenever 1.5 < r < 2.5. We have 
then; 


,(t) . 


(A2.1-6) 


Since the sample Is taken at the time t » kT^, the k^^ sample of 
x(t) Is: 


x^ . Ae 


(A2.1-7) 


Using Equations (A2.1-3) and (A2.1-5), x. can be wHtten as: 


Xk - Ae 


kr 


(A2.1-8) 

If this sampled signal Is now Inserted Into the DFT, Equation (A2. 1-1) 
becomes : 

(A2.1-9) 


j N-1 j|^(r-n) 

K • h T. Ae 
n H 
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Consider first the simple example where r ■ n, corresponding to an 
f^ In the center of bln n. The effect of the summation In Equation (A2. 1-9 ) 
Is simply to add A a total of N times so that equals A. Thus the 
transform has a ‘‘gain" of unity In center band. 

Next c(xis1der a signal x(t) which has the center frequency of Bln n 
but which has a phase angle 6. That Is: 


x(t) « Ae^^“n^ * (A2.1-10) 

Then, clearly: 

^n ^ (A2.1-11) 

Hence X^, the complex number out of the DFT has the same amplitude for an 

Input signal of any phase, and has the same angle as that of the Input. 

Let us now generalize to the case where r takes some general value, 

not necessarily equal to n. Define d to be the difference between r and n: 

d 4 r - n (A2.1-12) 

Assume now that the Input signal Is: 


x(t) . 

jS kr 

Xk = Ae e N 


so that the DFT is: 

X * i Ae 


:2iT 


1 iuJe Jn 
n F 


Z e 
k*0 


Ae^® 

N 


L 

k*0 


k 1 - a*^^^ 

and since Z a « — * 


k*0 




k 


(A2.1-13) 

(A2.1-14) 

(A2.1-15) 

(A2.1-16) 

(A2.1-17) 
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(A2.1-18) 


je 


1 - e 
1 - e 


j2iTd 


^jrr{d-d/N) simrd 

— ® - s w m 


If N is large, which is our case, can be approximated by; 


Xjj = Ae'^®e'^’^‘*sinc d 


vdiere: 


sine X A 


simrx 

irx 


Hence the magnitude and phase of X^ are respectively: 


|X^1 = A sine d 

A * ® * 'd 


(A2.1-19) 

(A2.1-20) 

(A2.1-21) 

(A2.1-22) 

(A2.1-23) 


These outputs (response characteristics) are plotted in Figure A2.1.3 for 
e equal to 0. 




a. Amplitude Respemse 


b. Phase Response 


Figure A2.1.3- Response of DFT to Sinusoidal Signal 
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It is apparent from Equations (A2.1-2;j) and (A2.1-23) and Figure A2.1.3 
that the response of the DFT to a sinusoid which is not in the center of the bin 
will be less than unity in amplitude, and will introduce some phase. 

We consider next the relationship between as obtained from one set 
of samples (one record) of x(t) and the obtained from the next record, which 
is obtained T seconds after the first. In Equation (A2. 1-7) substitute kT^ + T 
for kT^. 

X, = Aej2-»^{kT3 - T) 

= (A2.1-24) 

since BT = 1 and = 1. 

Since the values of X|^ in Equations (A2. 1-7) and (A2.1-24) differ only by 
the term which does not depend on k, the resulting DR's will differ 

only by this factor. Thus the angle of obtained from the second record 
will differ from that of the first by 2trd. 

The implication of the above result is that any phase coherence which 
the sampled signal possesses will be lost in the DFT operation if the signal 
is not in the center of the band. If, however, an X^ is observed which 
changes phase angle linearly as a function of time but does not change in 
amplitude, then it can be deduced that the input signal is a sinusoid of 
frequency 

f - f + IMi 

r n 2tt (A2.1-25) 

where A0 is the change in the phase of X^ over T seconds, that is from one 
record to the next. A first order indication of a linear change in phase is 
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an equal phase difference between successive pairs of X^'s. That Is: 


Sg - 20J + 0Q - 0 (A2.1-26) 

The test Implied by Equation (A2.1-26) could be in^lemented quite easily to 
search for constant frequency signals as processed by a DFT. 

Me consider next the case of a sinusoid which drifts linearly In 
frequency as a function of time. That Is, the Instantanemc frequency Is: 


w. = + wt 

1 r 


(A2.1-27) 


where H Is the drift rate in radians per second. The corresponding 
signal is: 

x(t) . * “-•t * e) 

Three cases will be examined depending on the value of W. 


(A2.1-28) 


Case 1; Small W 

We assume first that the instantaneous frequency is nearly constant 
within the bin of interest over one record, of length T seconds. We express 
this condition as: 


-T « B 


W « 


2ttB 


“2? ■' ' " T (A2.1-29) 

In this case the frequency input is essentially constant over one set 
of data. If that frequency is W^, the output is approximately equal to 
that given by Equation (A2. 1-191. 
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Case 2; Intermediate W 

We start by assuming that the record begins at t « 0 so that the 
Sfflnple Is again taken at t = kT^. The sampled Input signal Is then: 


= AejV^“»*^s + Wkn|/2) 


(A2.1-30) 


and hence: 


. Ae'*® * 2FTF ''') 


*n' N 


z e 
k»0 


(A2.1-31) 


The effect of the second term (due to frequency drift) In the exponent 
Is to shift each of the k complex terms In the s«n by the angle Wk*/2B*N^. 

If this shift Is small compared with 27rdk/N, we can neglect It, and we are 
then back to Case 1. But now we are considering the case where this term is 
too large to neglect. The summation in Equation ( A2. 1-31) is not tractable. 
Our approach is to obtain an approximation by calculating for a number of 
fixed values of frequency over which the signal drifts in T seconds, and 
averaging over these X^. That is, since X^ for a fixed frequency is given 
by Equation (A2.1-20), Xp for a drifting frequency is approximated by 

j,d„ 


K = M 

n M 


Where: 


I e'* sine d. 
m=0 

W mT 


m 


'^m ^ M-1 


(A2.1-32) 

(A2.1-33) 


As an example to illustrate this approach consider a signal whose frequency 
drifts from f^ (d = 0) at t = 0 to f^ + B (d = 1) at t = T. Let M = 3 so 
that the fixed-frequency X^ are obtained for d - 0, 1/2, and 1. These 
values are: 

X = Ae'^® 

Ho 
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X„ » Ae^®(0) 

112 

and average Is: 

= 0.33 + j0.21 

This procedure was repeated for a number of values of M with the results 
shown in Table A2.1.1. 

M Yn/Ae-^^ 

4 0.3017 + jO.2686 

8 0.2615 + jO.3348 

12 0.2492 + jO.3536 

24 0.2373 + jO.3713 

100 0.2284 + jO.3840 

Table 72.1.1 - Approximate X^ for 0 £ d £ 1.0 

The results approach an asymptotic value. We. shall accept the value 
for M = 100. That is: 

X = Ae^®(0.23 + jO.38) 

= Aej^0.4 4/58.8° 

This suggests that a drifting signal is definitely attenuated by a 
significant amount if it drifts from center bin to an edge in one observation 
time. 

We next consider the case for a signal which drifts from the center of 
a bin to a point part way to the edge (0 < d < a). The resulting approximate 
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values of were calculated by an averaging process paralleling that 
suggested by Fq. (A2.1-32). The resulting values of are given in 
Table A2.1.2 for four values of a. 


Drift Range 


0 < d < 1.0 

0.44/58.8' 

0 £ d £ 0.5 

0.79/41.4' 

0 £ d_£ 0.2 

0.96/16.9' 

0 £ d £ 0.1 

0.99/8.1° 


Table A2.1.2- Approximate X^ for Various Drift Ranges 

These results show that the DFT can accommodate a certain amount of 
drift per record, up to perhaps 20% of the binwidth, without serious 
degradation. However, a drift from the center of the bin to an edge clearly 
produces a very significant attenuation in the DFT output. Clearly it is 
desirable that the relative drift per record (compared with the bandwidth) 
be kept small to maintain the basic system sensitivity. Two strategies are 
available to keep the relative drift small. These are: 

a. Increase the binwidth B 

b. Change the center frequency of the bin, f , as the 
signal drifts. 

The first strategy has the disadvantage that it increases the amount of 
noise passed by the bin. The second strategy has the disadvantage that it 
requires a method of forcing f^ to track the drifting input signal. 


Case 3: Large W 

Next we consider the case in which W is very large. Specifically: 


W 


2i“ 


•T » B 


W » 


2ttB 


(A2.1-34) 
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This means that the carrier signal sweeps through the bln In a time which 
Is short compared with the record time T. 

The problem Is simplified by assuming that the response Is due only 
to the Input while the carrier Is within the bln (I.e. -1 £ d £ 1). This 
Is equivalent to assuming In Figure A2.1.3a that Xn * 0 when |d| > 1. In this 
case can be approximated by: 

iuJ® M-1 . . 

’'n “ JfiT ® *^m (A2.1-35) 

fTl“ -W+i 


The solution to Equation (A2.1-35) Is easily found from the solution to 
(P2.1-32) since the values of the X^ contributions for negative m are just the 
complex conjugates of contributions for positive m. The resulting value of 
X„ for M = 100 and -1 < d < 1 Is 

X„ s fle-j® X 0.22 
n 

This result assumes that the time required for d to sweep from -1 to +1 Is 
just T seconds, the record length. The time required for the carrier to 
sweep from a value corresponding to d = -1 to d = 1 Is: 


'2d 


- 2B X 2tt 

R 



If average value of X^ is: 


0 . 2 ? 


T 


(/>2.1-36) 


(A2.1-37) 


= ij76g^je (P2.1-38) 

U * ^ 

Hence the amplitude of the DFT output is reduced to as little as 22% of the 
non-drifting case, and can be much less If the drift Is quite rapid. 
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Response of the DFT to Noise 

Next we consider the effect of the DFT on noise. Me assume that the 
Input Is Gaussian White noise with zero mean and power spectral density 
We assume that this noise Is passed through an Ideal bandpass filter of 
bandwidth B^. The resulting bandpass noise can be written In the Rice 
representation (Vlterbi, 1966) as: 


n(t) = n (t) cos W t + n (t) sin W t (A2.1-39) 

v» V i w 

where W^ Is the center frequency of the bandpass filter, and n^(t) and n^(t) 
are low frequency Independent Gaussian random processes. The variances of 
n(t), n^(t) and n^(t) all equal NqBj, and we shall call this variance a*. 

V Vt (A2.1-W) 

c s 

The quadrature detection process used in the receiver yields the complex 
low frequency (envelope): 

x(t) = n^(t) + jn^(t) (A2.1-41) 

The kth sample of x(t) is 

\ ^*^k (A2.1-42) 

where aj^ and bj^ are defined respectively as the kth samples of n^(t) and 
n^(t). Since the original processes n^(t) and n^(t) are Gaussian, the 
samples aj^ and b|^ are likewise Gaussian with zero mean and the same 
variance 

^a ” ^b " (A2.1-43) 


The DFT of the complex samples x^^ is 

N-1 


Y - i r V - n 1 9 Ml 

k“0 


(A2.1-fl4) 
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k-0 " k-0 


(A2.1-45) 


C 4 ««a nk/N _ _ 2ffnk 2irnk 

Since e *' * cos - jsin -j^ — 

Equation (A2.1-45) can be written as 

V — 1 1 ^ * 2Trnk x k e<n 2TTnk \ X r fh rft< ^?m!L 

" <*k“* T- * •’k*'" “IT“ > * JH 1,^ •'’k"' TT- 


(A2.1-46) 


a^sln ^ ) 


This result can be written as: 

_lN-l 


N-1 




k=0 


k*0 


(A2.1-47) 


(A2.1-48) 


where: 


^ - > i.«c 2rrnk . . . . 2nnk 


A - k 2irnk ^ 2iTnk 

k ” O|^cos ■ |j" “ “ 3|^sin 


Clearly Cj^ and dj^ are Gaussian random variables since the sum of Gaussian 
random variables Is Gaussian (the cosine and sine terms are just constant 
weights for any given k). Similarly the two sums In Equation (A2.1-48) are 
Gaussian, and hence is a complex Gaussian random variable. It is easily 
shown that the mean values of Cj^ and dj^ are zero. The variance of Cj^ is: 

“cij ■ Ti~ ^k®'" N * * 

= E(a,,^)cos^ C(b|^*)sin‘ ^ 


= o 


2 



+ Sin* 


2Trnk \ 
N • 


- 0 


2 


(A2.1-49) 
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Since the C|^ are assumed to be Independent over k, the variance of the sum 
of the C|^ is the sum of the variances. Hence the variance of times the 
first sum is The same arguments apply to the second sum. In summary* 
is a complex random variable whose real and imaginary terms are 
independent Gaussian with zero mean and variance o^. Equation (A2.1-49) can be 
written in polar form as: 

(A2.1-50) 

We review next the well-known (Papoulis, 1965) distributions of the three 

raiiJjm variables R^, 6, and e 2P 
n n 

The random variable R^ is Rayleigh, 

f(Rn) = ^e''^n"/2a2 ^ 0 < < « , (A2.1-51) 

The angle 0 is uniformly distributed. 

f{6) ~ » -n £ 0 £ Ti (A2.1-52) 

fhe random variable P, which is the pov^r of X^, is exponentially 
(or Gamma) distributed: 

f(P) = -^ (A2.1-53) 

We now have the statistics of the output of the DFT when the input is 
Gaussian noise. Next we turn our attention to the statistics of the sum of 
a number of DFT outputs. This sum can be taken directly on the X^, or on 
the power P, if a square law detector is first applied to the X^. Consider 
the former case first. 

We assume that the X^ being sunviied are independent complex Gaussian 
random variables. Let the sum of M values of X„ be called S. 
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(A2.1-54) 


S 


M-1 
Z X 
n*0 


n 


The real and Imaginary terms add but since these are Independent and 
Gaussian, the two terms are both Gaussian, with mean zero and variance Mo^ . 
Hence S is still a complex Gaussian random variable. Its amplitude, angle 
and power are given by Equations (A2.1-51) - (A2.1-53) wher^ o* is replaced by 
Mcr*. 


this sum 


We next consider the sum of M Independent values of power P. 
M-1 


^M 


I P. 
m»0 


m 


We call 
(A2.1-55) 


Where the P^^^ are the powers which are to be added. They are exponentially 
distributed as given by Equation (A2.1-53). The mean and standard deviation of 
P both equal a*. 

The sum of M Independent exponentially distributed random variables Is 
known to follow the Gamma distribution (Papoulls, 1965). 




P 

"2M” 

o 


M-1 

M 


(M-1)! 


g-PM/o* 


(A2.1-56) 


where the mean and standard deviation of Pj^j are: 
E[P„] .Mo' 


(A2.1-5?) 


Op * M 


(A2.1-58) 


The Gamma distribution becomes Increasingly Gausslar-llte as M 
Increases, that Is, as the number of sunned terms grows. This Is as one would 
expect from the Central Limit Theorem. 
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Note that If the In Equation (A2.1-55) represent signal power, and 
hence are Identical for each m, the total signal power Is MP^. That Is, 
the signal power grow In proportion to M, On the other hand, the standard 
deviation of the total noise power Is growing as ✓fT, as Equation (W. 1-58) 
shows. 

Hence the ratio of the total signal power to the standard deviation of 
the total noise power Increases In proportion to vfT, v^ereas the SNR 
increases In proportion to M If the sunmatlon Is carried out on the 
amplitude Instead of the power. Thus It Is more desirable to carry out the 
summation over amplitude values If this Is possible. Unfortunately, this 
cannot be done If the amplitudes are Independent and random, or appear to 
be so because of the action of the DFT In the MCSA. 
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Appendix A3.1 


Polarization Synthesis 


Given a desired type of polarization, as specified by the angles 
5 and Y* what are the weighting factors needed to combine the outputs of 
two orthogonal antennas In the form 
z ■ Kj X + K 2 y 

In such a way as to synthesize an antenna of that polarization? 

In general, Kj and Kg are complex constants. Writing them In polar 
form, we have 

Z « X + iKgle^®^ y 

The value of 6 is the (specified) phase angle difference that a 
signal will have between the x and y antennas. To optimize reception, 
this phase difference must be cancelled, which gives the relation, 

6 + (0 0 ^) = 0 

We are free to choose any values of 0 g and 0 j, which satisfy this 
equation, so let us choose for simplicity, 

Oj = 0 and 0 g = 6 

The value of y defines the relative magnitudes of the received 
signals, in the form 

^ 1 ^ 2 . I 

tan Y = I 

We are free to choose any values of Kj and Kg that satisfy this equation. 
Therefore let us also impose the condition that power be conserved by the 
confining operation, or in other words. 
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Solving these two simultaneous equations for |K|| and jK^j. we obtain 

|Kj|» cos Y 
IKgl* sin Y 

Thus the required equation for combining the x and y outputs to match the 
polarization type ( 6, y) Is 

Z = X cos Y + y sin y (A3. 1-1) 

From (3-1) and (3-2) we can obtain; 

cos Y = 1 ^1 — 

sin ^ * bd 

/RpTTTp 

eJ-S , g(^ - /x) 

Then substituting these measured quantities Into equation (A3. 1-1), we obtain 
final ly , 

7 . L- ' l X * -=jlL=^ y (A3. 1-2) 

/|x|= * |y|' /lx|" * |y|' 

If the outputs of the x and y antennas are combined according to this 
equation, the result will be the same as if a single antenna that was 
exactly matched to the polarization type of the Incoming signal had been 
used, regardless of the polarization type of the signal. 

Effects of Noise 


Thus far, the effects of noise have been neglected, for the sake 
of simplicity. Noise causes the measured polarization type to have some 
probable error, and therefore the synthesized polarization type does not 


exactly «atch that of the sl^ial. Befbre discussing this effect 

explicitly. It Is Interesting to note that equation (A3, i-2) reduces to a 

nearly trivial equatlra In the coeplete absence of wise. If m write 

X » |x|e^^ 
y = |y|e^^ 

TN»», 

Z*-==i= 

4*1* + |y|* 

Z * r e^^ ( lx|2 + |y|*n 

4xj* + |y|* L I I ij 

I * •'^xj* + |y|* 


.i/a + |y|*e^ 


.) 


The fact that the phase of Z is the sane as the phase of x has no 
particular physical significance, and stem solely from the p»'evious 
choice of 0j and O^. The fact that the power in Z is the sum of the 
powers in X and y stems from the conservation of power constraint 

placed on |K^| and IK 2 I earlier. Thus we conclude that in the complete 
jd>sence of noise, ttie polarization matching method extracts all of the 
available signal power, since 

P7 = Z* = X* + y* = ♦ P 


Consider now the case when the signal to noise ratio (SNR) is small. 
The point M on tt%s Poincare sphere in Figure 1 now marks the maximum 
value of a probability density function on the surface of the sphere. 

In order to increase the accuracy of the measured polarization type, it 
is necessary to average the weighting factors and over time. 

c- V 
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The magnitudes of the factors then become 


|Kil 

IK2I 




m \ 

4^^Fh7F| 


Note that It is not exactly correct to factor out the denominator of 
these expressions, or to regard it as the average power. To do so 
implies interchanging the order of averaging and confuting, which leads 
to a sometdtat different result. 

An average phase difference between the x and y antennas must also 
be calculated. This could be done by simply averaging their phase 
difference over time. A statistically better average is the phase angle 
of the complex correlation coefficient between the two antenna outputs, 
given by 


M 


xy 



(0 < l^xyl - 


Since the denominator of y is real, it does not affect its phase angle. 

xy 

Thus 




* = tan 


Im(xy*) 


Re(xy*) 

This is a ti^ighted average phase angle, where the lighting factors are 
the product of the amplituctes. This may be seen by considering the polar 
coordinate form of x and y. 


Then 


Let X 

y 

xy 


r e 

X 

r e 

y 


jax 

jay 


* * 
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If Me have N tlm samples of x and y» then 

from Mhich the weighted average effect nay be seen. Wien the product 
''‘xi'^yl corresponding phase difference will be weighted 

Iwavlly. That product corresponds to cross power, so In fact Instantaneous 
cross p(ni«r Is the weighting factor. 

SiMmarIzIng these results. In the presence of noise we combine the 
outputs of the X and y antennas according to 

Z = KjX + K^y (A3. 1-3) 

where — — 

'‘i ] ' 


K 


2 


5 


= tan‘^ Jinffi) 

Re(xy*) 


e 




The factors and emphasize the antenna that has the strongest signal, 
and hence the best SNR. Since the time averaged Information must be used 
to combine even the first signals, a time (telay must occur before the 
ccxmblned output Is available. This may be occompllshed by recording the 
signal, or by appropriate data processing. However, It will turn out that 
the final expression for generalized coherence to be derived below may be 
computed contlniwusly In quas1*real time, using "pipeline” techniques similar 
to those used In real-time FFT calculations. 

The above results may be generalized to any two orthogonal antennas. 

The fact that x and y oriented linear antennas were being used only affects 
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the <tef1n1t1on of the polarization angles y and £ on the Poincare sphere. 
Horizontal linear polarization Is chosen by convention to the origin of 
the coordinate system. However, one could with equal validity choose any 
other point (such as left-circular) as the orloln, and then (tefine new 
polarization angles exactly analogous to 6 a*>d r , relating the 
amplitudes and phase angles of the two new antennas. The angles 5 and 
Y are only intermediate variables, and once they are eliminated by 
substituting their values In terms of the observable quantities Into 
equatIcMi (A3. 1-3), their meaning and (tefinitlon become Irrelevant. Thus 
equation (A3. 1-3) Is true for any two orthogonally polarized antennas. For 
example, if left and right hand circularly polarized antennas were used, 

X and y could simply be replaced by R and L. 




Appendix A3.2 

C(rfierency Matrix and Degree of Polarization 


A matrix known as the CoherenQ^ Matrix contains all of the polarization 
Information ^out a signal (Ko, 1962 and Beran et a1., 1964). It Is 


[0 


x: 

K 


XX* 

y y? 


where x and y are the complex output signals from two orthogonal receiving 
antennas. The total signal power Is 

Pj = trace * x5(^ + yy^ 

The (tegree of polarization is 

4 det Ccf ‘ 


■{ 

■I 


1 - 


(trace [c])' 


1 . 4(xx* yy* - xj^* x*vi 

{ xx’^ + yy^ j ^ 


Using the fact that = P^, yy* * P„ and xy* x*y » P^„ we have 

X .j,t xy 


m 


/ 


1 - 


Jt ^ 

4(PA - ) 


X y x: 


(!•« ♦ P,) 


This may also be rearranged to 


m * 


+ P„ ^ 


Pjj + Py Y^^’x ■ ■*“ ^^xy 
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/Ippendix A3. 3 

Effects of Noise on the Measured Degree of Polarization 

The measured degree of polarization Is affected by noise In two 
distinct ways. The first Is that It has statistical fluctuations, and 
therefore must be computed using coherence iwtrix values that are 
averaged over a significant length of time. 

Degree of polarization behaves like a correlation function In that 
if only one time sample Is used, its measured value will always have a 
value of 1. As more and more time samples are used, its measured value 
will approach the true value, from above (Bendat, 1958; Lee, 1970; 

Otnes, et al, 1978). This behavior is illustrated in Figures A3. 3. 2 andA3.3.3 
for various signal-to-nolse ratios. 

The probability density for the degree of polarization is shown in 
Figure A3. 3.1. These curves were computed by adding signals to random noise 

in such a way as to simulate the actual operation of experimentally measuring 

the degree of polarization. The curves apply to a single antenna which is 

matched to the signal polarization. Each curve is a smoothed histogram of 

5000 total samples. The integration time used for these curves is 25 
data points. 

The second effect of noise on the measured degree of polarization is 
that receiver noise cannot be directly distinguished from unpolarized 
signals. A strong partially polarized signal may have th; same measured 
degree of polarization as a weak polarized signal. This effect manifests 
itself as a measured degree of polarization that is always lower than the 
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Figure A3. 3. 2 - Measured Degree of Polarization, when the True Degree 

of Polarization is 1.0. 




Figure A3. 3. 3 - Measured Standard Deviation of Degree of Polarization, 
When the True Degree of Polarization is 1.0. 
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true ctegree of polarization. When the SNR Is large, the measured and 
true values are the same, and when the SNR Is very small, the measured 
value 1$ nearly zero. 

This effect may be quantified as follows: The received signal 
powers In the two antennas are 

Pjj » Pg ( + m cos^ ) 

Pj, • Pj ( + m ) 

The total received signal power Is, after simplification 

P - P + P = P 
r X y s 

To this we add the receiver noise, in the form 



Since the receiver noise power In each receiver is the same, we add half 

of the total to each. After the polarization type has been matched, the 
2 

cos factor is 1, so the total received power In the polarization-matched 
antenna is 

Pr * % ( -4^ + “55^ + m ) 

If we lump together the two noise-like components of the power (receiver 
noise and unpolarized signal), and regard them as being caused by some 
other unpolarized signal, then the original unpolarized power of that 
signal must have been twice that measured (since any antenna receives 
only half the unpolarized power). Thus 

^unpolarlzed * % ^ 1 - m + ) 

1 
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The polarized component 1$ mP^, so we can coelute the measured degree of 
polarization m'» as 

. . polarized power In original signal 
total power in original signal 

Sfsu 

^ <* ■ " *Tnr 

* " < TTTIIS — > 


This effect Is Illustrated In Figure A3. 3.2, as the asymptotic values 
approached as N -*• » . If It were desired to measure 
the true degree of polarization accurately, one could measure the SNR 
separately (by comparing the total measured power at the time when the 
signal Is sought, with a globally averaged total power), and then Invert 

the above relation, 

. / 1 ♦ SNR X 

' — m — > 


This <k>es not become Infinite for small SNR, because m* Is also small In 
that case. In the limit both SNR and m* become zero simultaneously, leaving 
m Indeterminate. This Just reflects the physical reality that you cannot 
measure the polarization of a signal that Is not there. 

Although in principle one could correct the measured degree of 
polarization for both the finite-sample effect and the receiver noise effect, 
it is not necessary to do so for the application at hand. First It should 
be pointed out that the two effects are In opposite directions, so that 
they tend to partially cancel. Second, the measured degree of polarization 
Is always a monotonically Increasing function of the true degree of 
polarization, so that a thresholding operation may always be carried out 
successfully, even In the presence of the two above-mentioned effects. 


- 285 - 


Appendix A3. 4 

Linear Phase Invariance Property of Coherent Power 

To demonstrate the linear phase Invariance property* let the phase 
of the signal z advance linearly by an amcxjnt AO radians per second. Then 
If we explicitly write out the averaging process for coherent power* 


Pc* 


N-1 

'WT' 1-1 


and change the z's to polar coordinates* 

, j( 0+lQA ) 

2^ * A^ e**' ' 


where 0 is the original (arbitrary ) phase angle of z. 
Then 


Pc* 


Pc* 


Pc* 


N-1 


1 


N-1 

1 


A J( 0+ 1^0 )a p-j( O’*- (1+1)A0) 
I ''i ® ''i+l ® 


i=l 

N-1 


j; A A 

N-1 ''i 1+1 ® 


1 

TT 


-jAO 


|N-1 

li=l ^ 


Pc* 


1 

Tpr 


N-1 


i = l 


^N +1 


Since this is independent of AO , see that the linear phase shift has 
no effect on the outcome. Note, however, that any other phase shifts 
(non-linear or random) would cause P to become smaller, thereby 

V 

illustrating its sensitivity to phase coherence. 
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Appendix A3.S 

fiwieralized Coherence Mathetwatical Detiils 


From Equations (3-5), (3-9), and (3-10), we obtain 


GCV ■ l^lU 


1 N-1 

*1+1*1* 


(A3.5-1) 


ffficrc e 

"•- 17 b- •'(P. - Py)* + 

^ y 

Z, • K,», ♦ V, 

and and K 2 were defined earlier In equation (3-5). Sidistltutlng for 
the values , the sunmatlon cerm becomes 


N-1 

Summation * I *^1+1^ ^*^1**1* * *^2*^1*^ 

■V ( |K,|'x,.iX,* ♦ l<2Ki*y,.iX,* ♦ K,K2*x,^jy,* 

+ (A3. 5-2) 


From equation (3-5), we obtain 

N 


|K 


1 ' 




lUi-i /lx^r + |y^l* J 


which may be regarded conceptually as the power In the x antenna, normalized 
to the total power; call It 



1 

If 


N |yj 
I -===—== 
1»1 /}x^|* ♦ iy^l* 


2 


which may be regarded conceptually as the ooii^r In the y antenna, normalized 
to the total power. Call It P^^^. 
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which may be regarded conceptually as the cross power between the x 
and y antennas, normalized to the total power, with a phase lag of 6 . 



which may be regarded conceptually as the ratio of the cross power 

between the x and y antennas, to the total power, with a phase lag 

of 6 . Call It . Then the summation becomes 

xyn 


Summation • ♦ P^e y,^jX,* 

The first term within the summation Is the autocorrelation function 
of the signal In the x antenna, weighted by the normalized pon«r In the 
x antenna. The last term within the summation Is the autocorrelation 
function of the signal In the y antenna, weighted by the normalized 
power In the y antenna. The middle two terms within the summation are 
time-shifted cross-correlation functions between the signals in the 
x and y antennas, weighted by the normalized cross power In the x and 
y antennas. The two middle terms differ In that one has the time lag 
In X, and the other has It In y, so that both are Included In a 
symmetrical way. The two middle terms also differ In their phases. 

Due to the polarization type of the signal, the phase of x Is expected 



1 


!l 
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to lead the phase of y by the angle 6 . This phase difference will on 
the average be cancelled by the e*^^ factor for y» and by the e 
factor for y*. The extent to which this i^se difference cancellation 
(^curs Is a neasure of the phase o>herence of tte signal both In t1«e 


and between the two antennas. 

Thus we see that the terns within the sinmatlon of equation (3-10) 
represent all of the possible tlwe-shi fted correlation functions of tlM 
signal* with eadi correlation function being weighted by Its Inportance 
relative to the particular signal being received. 

The nuUlpllcatIve fact(xr outside of the sunnatlon of equation 
(3-10) takes Into account all of the possible non-tIne-shIfted 
correlation functions of the signal* namely 


P 
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In summary, the generalized coherence expression Involves all the 
possible correlation functions of the signal. The separate effects of 
polarization and time coherence lose their individual Identities, and 
thus generalized coher^ce may be regarded as a fundamental characteristic 
of the signal itself* apart from the way in idiich it is received and 
measured. 

The correlation functions In equation (3-12) are merely generalizations 
of those found In the coherency matrix discussed earlier. This leads to 
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tht suggestion of « Generalized C(^ren^ matrix. 


m 


xx*(t) yy*(T) 
x*y(t) ypTr) 


inhere t Is the time shift of t1« correlation function. 


Uhen T * 0, this 


matrix re<fcices to the coherency nwtrlx, which as stated previously 
coR^letely defines the polarization of a signal. Uhen t ^ 0, this matrix 
coa^letely def1n» the time coherence prt^iertles of a signal. Thus the 
Generalized ^erency matrix cai^)1ete1y defines both the polarization and 


time coherence properties of a signal. 


Generalized Cohcrenct Statistics 


Test cases showed that GCV Is Insensltl^ to the distribution of 
signal pcMier between tl« two feed antennas, so all slmilatlons added 
the signal to only one antenna, and the s1gnal-to-rMlse ratio Is 
thei^fore defined to be that In a polarlzatlon-natcl^ antatna. The 
actual operaticmal SNR Is always Imier than that, since the other 
antenna contributes an equal amount of noise and no additional signal. 

It Is difficult to define a SNR which takes this polarization effect 
into accotmt satisfactorily. If one simply takes the ratio of total 
signal power to total noise power (adding powers arithmetically), one 
obtains a SNR that is one half of that in a single matched antenna. 

On the other hand if one were to add together the output power signals 
from the two antennas, using simple square law detectors, the SNR would 
be /T* less than that in a single matched antenna. The definition used 
here always un<terest1mates the performance of GCV relative to total power 

detection. 

In Figure 3.5 the open circles are the simulation outputs of 25 
seconds integraticm on the sum of the powers of the two feed antennas. The 
solid line is the theoretical locus of these points, representing a SNR 
improvement of five, due the integratiem over 25 data points (i.e., i/25 * 5). 
The plus signs are the simulation outputs of GCV, computed over the same 25 
data points. The dashed line is sinv>ly a smooth curve drawn through these 
points. 
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Figure A3. 6.1 - Computed Probability Density for Time Integration of Total Power 





0.04 



Figure A3. 6. 2 - Computed Probability Density for Generalized Coherence 
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For lOM SNR, additional data points were obtained for Figure 3.5 from 
the simulation probability density curves shown In Figures A3.6.1, A3.6.2, 
and A3.6.3. The points were obtained using a relation: 


OUTPUT SNR 


(HEAN VALUE. AT DESIRED SNR) - (MEAN VALUE. AT ZERO SNR) 

STANDARD DEVStIWT'AT ZERO Sfc 


The power points are shown as solid circles, the 6CV points as letter X's, and 
the square of GCV as solid triangles. Since the simulations for probability 
density Involved a much greater manber of total data points (5000) than did 
the simple comparison simulations (25), the experimental error Is much smaller 
In the former case. This explains why the points obtained from the power 
probability density curves fall exactly on the theoretical curve, whereas the 
others exhibit a scatter. 


Appendix A3. 7 

The Possibility of Truly Coherent Detection 


Generalized coherence does not require knowledge of the exact signal 

frequency within an HCSA output bln, because of Its phase Invariance 

property. The one remaining piece of information that Is not used by 

generalized coherence Is the phase angle of the degree of coherence 

(I.e., the phase angle of the complex autocorrelation function). This 

phase angle Is In fact the average phase change per data point (I.e., the 

linear phase term). The offset frequency from the center of the bln Is 

(since f = ), In units of blnwidths. In principle, 

one could compute after taking the polarization Information Into 

account, and then use It to apply a phase correction to the signal that 

compensates for Its frequency within the MCSA bin. The procedure could 

be iterated If necessary, by computing a new after the first 

corrections had been applied, and repeating the process. This would 

presumably permit a truly coherent detector to be used, in place of 

that portion of the generalized coherence that calculates l^z-^,z-*| . 

N 

Instead we would compute I ^ ’ where 

zz 

serves as a measure of the coherence of the signal. 
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Appendix A3.8 
Theory of ANOVA 


Analysis of variance Is deeply rooted In the history of 
statistical methodology (Cochrane and Cox» 1962, and Scheffe, 1950) and 
It Is a prime tool for the handling of data obtained frxm deslgr^d 
experiments. As its name implies* it involves the systematic parti- 
tioning of the total variation in specially obtained data sets. The 
partitioning Is done In order to assess the contribution each designated 
source of variability has on the total variance. 

The form of analysis of variance that is appropriate for the SETI 
data matrix is the two-way ANOVA with cross classification. The columns 
of the data matrix represent the FREQUENCY in Hz* while the rows represent 
the TIME in seconds. Cross classification is obtained by having at least 
two (Aservations per matrix cell. This "pseudo" set of replicated obser- 
vations (Sec. 3.5, ANOVA, Performance) is necessary for the calculation of 
an interaction effect, and an interaction effect is necessary for the detection 
of certain types of signal. So, to consider the most general class of signal 
types, an r X c matrix of time vs. frequency, with n observations per cell 
will be the format for the analysis of variance done on the data. 

Let represent the observed response of the replicate to the 
i^^ level of the row treatments and the level of the colixnn treatments. 
Stated in a more physical fashion, Y^^j^ represents the calculated (observed) 
polarized power of the k^^ replicate at the i^^ time period and the 
frequency . 
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Each observed response, can be represented by the following 


model : 


»)jk ■ U ♦ P, + Yj ♦ (py),j * Pijk 


(A3.8-1) 


where i = l,...,r and r is the number of levels of the first factor (time); 

j • l,..,,c and c is the nuiriber of levels of the second factor 
(frequency) ; 

k » l,...,n and n is the number of replications for each row x column 
combination; 

while u is the overall mean response, which, for this problem, is the 
average power , represents the effect of the i level of the first factor 
averaged over c levels of the second factor; (the i^^ time interval response 
values averaged over all frequencies). 

th 

Yj represents the effect of the j level of the second factor 

J 

averaged over the r levels of the first factor (the frequency averaged 
over time). 

(py)jj represents the interaction between the i^*’ level of the first 

* %) 


factor and the level of the second factor (the interaction between 
the i^^ time (second) and the frequency). 

^ijk *"®P'"®3ents the deviation of Y^jj^ from the population mean 
response for the (ij)^^ population. 

To demonstrate how the total sum of squares can be partitioned 
into the sum of squares attributed to row and colwim and interaction 
effects, the following mathematical discussion is presented. 
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Consider the deviation obtained by subtracting the mean v In (A3.8-1) 
from and substituting the following point estimates for the 
parameters: 


0 " 





Ijk ■ ’'U- 


The following expression for the partition of the total deviation about 
the mean: 


Is obtained. 


(T,„ -T...)*(7.j. -T...) 

-T,.. -T.j. (A3.8-2) 


Three tests of hypothesis can be realized by looking at the computed 
F>rat1os for rows, columns and Interactions. For a given a*level (a 
prespecified risk) the critical value can be found for the appropriate F- 
dlstrlbutlon. The null hypothesis of no-signal present Is rejected If the 
confuted F-rat1o for rows, or colimms or Interactions exceeds Its critical 
F-value. 

The choice of a, which Is the risk al1ov«d in rejecting the null 
hypothesis, H^, even though It be true, can be made as small as desired 


* The <tot {•) Indicates summation over all valws of the Index replaced 
by the dot; I.e., 

V,„ » - - ][ g ? J S Y... 
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(Type I error) (towever, a very small a will result In a large Type II 
error* called 0. 0 Is the probability of accepting the null hypothesis 
when It Is false. The relationship between Type I and Type II errors Is 
Indicated In Table A3. 8*2. 


HYPOTHESIS 


DECISION 


ACCEPT REJECT 


NO 

ERROR 

a 

TYPE I 
ERROR 

0 


TYPE II 

NO 

ERiXR 

ERROR 


Table A3. 8. 2 - Type I and Type II Errors 


For the detection problem, a represents the probability of affirming 
the presence of a signal when none Is present; I.e., a FALSE ALARM. 0 
represents the probability of MISSING A SIGNAL when one Is present. A 
tra<te off between these tm types of errors Is always present In any 
statistical hypothesis testing. A decrease In one type of error results 
In an Increase in the other type of error, for a fixed number of observations. 
Usually, a is preset at some tolerable level, and the value of 0 is 
determined for various alternative hypotheses. The complement of 0, namely 
(I - 0} Is called the power of the test. Plots have been determined 
(Chemical Rubber Co., 1968) showing the power of ANOVA for given a against 
various alternative hypotheses. The alternative hypotheses state that a signal 
Is present. (%>v1ously, the stron^r the sl^al the higher the value of (1 > 0), 
and thus the greater tiw 11kel1N>od of Its being detected. 
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Experimental Conditions 


Appendix A3. 9 
AWVA Performance 


1. All experiments were carried out at nine signal -to-nolse ratio (SNR) 
levels, starting with 9 db and decreasing to -12 db In decrements of 
3 db. The SNR is represented In the figures by the vertical axis. 
The horizontal axis represents the row, col inn, or Interaction 
variance ratio depending upon the type of signal being analyzed. 

2. Four types of signals were consistently analyzed. They are: 

(a) non-drifting narrow-band signals, 

(b) broad band pulsed signals, 

(c) slowly drifting narrow-band signals, and 

(d) drifting narrow band signals. 

These four types represenf orobable forms of a meaningful ETI 
signal, and are diverse enough to trigger one (or more) of each 
type of hypothesis provided for by ANOVA. 

3. Three factors that affect the quality of detection of signals by 
ANOVA were examined for each of the four signal types of all 
specified SNR. These factors are: 

(a) width of signal 

(b) frequency averaging 

(c) number of bits used to represent the signal power (quantization 
level ) 

4. The data for all combinations of the three above-mentioned factors 
were each analyzed six times with six different noise fields, and 
the averages of the six results are shown in Figures A3. 9. to A3.9. 
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Discussion of VaHince Ratio vs* F«Ratio 


The use of the expression "variance ratio" Instead of the frail lar 
F-ratIo Is In recognition of the fact that tlw distribution of the noise 
power data Is exponentially distributed rather than Russian* hence the 
ratio of the squares of these ran<kMi variables follows srae dIstrlbutiM 
that Is not yet known. Until the tl^retlcal distribution Is established* 
critical values appropriate to the F*d1str1but1on for various a>1eve1s can 
serve as conservative thresholds for the hypothesis tests. Empirical 
evidence of the distribution of the variance ratio for the noise data, 
seems to support this assumption. 

Description of Figures 

Figure A3.9.1 depicts the representation of four types of signals In 
the noise field matrix. 

Figures A3.9.2(a-d) (temonstrate the sensitivity of ANOVA as a function 
of signal width. Widths of 1* 2 and 4 Hz bandwidth are superimposed upon the 
noise field according to the signal type pattern. As expected* an Increase 
In signal width In general Improves the detectability as Is evidenced 
by the larger values of the variance ratio. In particular. Figure A3. 9. 2(a) 
cteplcts the response of ANOVA (as represented by the colunw variance ratio) 
to a non-drifting narrow-band signal. If a variance ratio of 1.3 represents 
a critical value, a one-Hertz signal would be (tetectable at -l.S db* a 


* The derivation of the distribution of the variance ratio for exponentially 
distributed random variables is being Investigated (Sogllero, 1980). 
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two-Hertz signal at -3 db, while the four-Hertz bandwidth signal would be 
detected it -6 db. It is understood that the signal is present the full 
20 seconds. 

Figure A3.9.2(b) shows a plot of the row variance ratio for a broadband 
pulsed signal. The signal has a bandwidth of 40 Hz and lasts for 1, 2, or 
4 seconds. Here the effect of signal width is even more striking than in 
Figure A3.9.2(a). If a variance ratio of 1.5 is considered as the threshold, a 
1-second pulse would be detectable at abort -2 db, a 2-second pulse at about 
-4 db, and a 4-second pulse at about 5.5 db. 

Figure A3.9.2(c) shows the plot of the column variance ratio for a 
slowly drifting narrowband signal. While the general trend remains as 
in A3.9.2(a and b) the increase in detectability is not as great due to tJie 

splitting of the variation in the data betv^en the co 1 umn and interaction 
sum of squares. Nevertheless, the improvement with signal width is 
sizeable. 

Figure A3. 9. 2(d) shows the plot of the interaction variance ratio, 
the term most logically suited for the drifting narrowband signal. 

Unfortunately, ANOVA fails to pick up the 1-Hz signal at any SNR. If 
the matrix were square the results would be better. For signals of 
2-Hz and 4-Hz bandwidth the results are better. However, for this 
configuration of matrix size and type of signal, ANOVA does not do well. 

Figures A3. 9. 3(a) to A3. 9. 3(d) show the effect of frequency averaging 

upon the variance ratio. In Figure A3. 9. 3(a), a non-drifting narrowband 
signal shows consistent improvement in its detectability as a result of 

increased frequency averaging. The improvement occurs as a result of the 
smaller residual variance brought about by the averaging. In general. 
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the residual variance Is reduced by a factor of 1/N, where N Is the 
number of frequencies averaged. For larger time x frequency matrices* 
an even greater Improvement In detection might be expected. 

A broadband pulsed signal displays a more ccMapllcated response as 
reflected In the row variance ratios plotted In Figure A3,9.3(b). While 
the 8 Hz averaging is highly beneficial for SNR > 0 db* a lessening of 
its effectiveness is (tetected at SNR < 0. In fact at -12 db* although 
mostly noise effects dominate, the 8 Hz averaging fares worse than e1t1»r 
2 or 4 Hz averaging. 

Figure A3.9.3(c) depicts the effect of frequency averaging upon a 
slowly drifting narrowbaml signal. Here* as In A3.9.3(a), the effect of 
averaging is highly beneficial* with 8 Hz averaging being consistently 
better than the other averaging levels. 

Figure A3.9.3(d) shows how the drifting narrowband signal Is improved 
by frequency averaging. 

As was shown in Figure A3. 9. 2(d)* the rapidly drifting signal Is most 
difficult to detect for the matrix size spec1f1ed--a square matrix being 
optimal . 

Figure A3.9.4(a-d) shows the effect of bit quantization In the 
generation of the noise field. Quantization Is a necessary outgrowth of 
the discretization of the data. The level of quantization refers to 
the nuflber of bits that are available for the algorithm processing. 

For the simulation studies 2, 4, 8 and 10 bits are used. 

The use of 2 bits implies that the observations of the noise field 
are represented by 4 numbers only. This represents the number of 
intervals allowed in the partitioning of the exponential distribution. 
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It is apparent that some information is lost at this level of quantization. 

In (Chen» 19%) the loss of such information as a functi(m of bit size is 
explored. Different scale parameters for adjusting the variance (of the 
exponential distribution) to the word length in order to minimize the loss of 
information, are presented. These values are used to adjust the quantization 
levels of the clipped exponential distribution. 

Figure A3. 9. 4 (a-d) reflects the results of quantizing the data of 
exponentially distributed noise pulse signal to bit values of 2, 4, 8 and 10. 
The degradation of the detectability of the signal at low quantizaticm levels 
is apparent. A gradual improvement results as the bit size increases. For 
bit size of 10 the curves are almost identical to the non-quantized values. 
Figure A3. 9. 4 (d) shows that quantization does not improve this signal type 
possibility of detection. For adequate performance of the ANOVA algorithm, 
eight bits are recommended. 
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signal Width vs. Column Variance Ratio 


Vertical Slope: Non-Orlfting Narrowband Signal 



Figure A3. 9. 2(a) - ANOVA's Sensitivity to a Non-Drifting Narrowband Signal 

as a Function of Signal Banchrldth 
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Signal-to-Noise Ratio 


Signal Width vs. Rom Variance Ratio 
Horizontal Slc^ie: Broadband Pulsed Signal 



Figure A3. 9. 2(b) - ANOVA's Sensitivity to a Broa(fcand Pulsed Signal 

as a Function of Time Duration 
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Signal-to-Noise Ratio, dB 


Slowly Drifting Narrowband Signal 

One Hertz Width 
O Two Hertz Width 
□ Four Hertz Width 

Drift Rate: 0.2 Hz/sec 



Figure A3. 9. 2(c) - ANOVA's Sensitivity to a Slowly Drifting Signal 

as a Function of Signal Bandwidth 
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Drifting Narrowband Sional 


A One Hertz Width 
O TWO Hertz Width 
pi Four Hertz Width 



Figure A3, 9. 2(d) - ANOVA's Sensitivity to a Drifting Narrowband 

Signal as a Function of a Signal Bandwidth 
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Signal-to-Noise Ratio, dB 


Non-Drifting Narrowband Signal 
^ Two Hertz Average 
O Four Hertz Average 
Q Eight Hertz Average 



Figure A3. 9. 3(a) - ANOVA's Sensitivity to a Non-Orfiting Narrowband 

Signal as a Function of Frequency Averaging 
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Signal-to-Noise Ratio, dB 


Slowly Drifting Narrowband Signal 

^ Two Hertz Average 
O Four Hertz Average 
□ Eight Hertz Average 


Drift Rate; 0.2 Hz/sec 



Figure A3. 9. 3(c) - ANOVA's Sensitivity to a Slowly Drifting Narrowband 

Signal as a Function of Frequency Averaging 
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Broadband Pulsed Signal 

V Ten Bit Quantization 
O Eight Bit Quantization 
□ Four Bit Quantization 
/2 ^Two Bit Quantization 




Rom Variance Ratio 


Figure A3. 9. 4(b)- ANOVA’s Sensitivity to a Broadband Pulsed Signal as 

a Function of Quantlzatlw 


^ Afotc NOT 
pf^CS>tNG PAG& 



Signal -to- Noise Ratio, dB 


Slowly Drifting Narrowband Signal 


^Ten Bit Quantization 


O Eight Sit Quantization 



Figure A3. 9. 4(c) - AMQVA's Sensitivity to a Slowly Drifting Signal as a 

Function of Quantization 




Figure A3. 9. 4(d) - ANOVA's Sensitivity to a Drfiting Signal as a 

Function of Quantization 

I 

I 
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I do loathe explanations. 

--Sir James Matthew Barrie, 
My Lady Nicotine 


Appendix A3.10 

Carrier Wave Detector Weighting Factors and Approximate Detection Statistics 


Amplitude Weighting and Signal Distribution Models 

The weighting procedure described below is a way of applying 
techniques of the Radon Transform (Radon, 1917) to the signal extraction 
problem at hand. Since it allows a signal to be picked up in a continuous, 
lin^^ar fashion, it is in some respects a generalized digital version of the 
analog, delay line video detector described in the CYCLOPS Report (1973, 
pp, 132-133). In general there are three potential signal distribution 
modes that could be used to determine weighting factors: the delta-function 

nx)del, which uses the value in the frequency bin whose center falls within 
the ray; the uniform distribution model, which uses weighting factors 
proportional to the area of each frequency bin contained within the ray; 
and the sine distribution model which uses weighting factors proportional to 
the area under the sine curve. 


- 320 - 



The NCSA Impresses a sine weighting functicw on the Incoming data; there- 
fore, the delta- function model 1$ not exactly appropriate for this situation. 
The most obvious choice Is to use exactly the same model as the NCSA; 
however, the following analysis of the difference between the two models 
seems to Indicate that little Is gained by ^Ing to a sine model, and a 
uniform model Is much easier to Implement. 

Consider a single time- frequency “picture element*' (pixel) of the 
array with unit length and unit width. The percental of the pixel 
value (pv) contributed to the total ray sim depends upon the way the 
pv is distributed throughout the pixel and the way In which the pixel 
is overlapped by the ray. For the uniform mo<tel the percentage Is 
just that portion of area Inside the ray, which in Figure A3. 10.1 is just 
1 • X » x; the contribution to the ray sixn is x times pv. 

Consider the case in which the weighting is by the sine function 
which is, by definition (Bracewell, 1978), 


sine X 


simrx 

TTX 


And, suppose the pv is distributed over the pixel so that the zero crossings 

occur at zero power, corresponding to the comers of the pixel, as Illustrated 
in Figure A3. 10.2. Since we wish to include the central Ic^e in the pixel, our 


purposes are served best by considering sine (2x - 1). In the sine model the 
contribution to the ray sum made by a given pixel is given by f(x) times pv 
where f(x) is x 

(A3.10-1) 


f(x) 


jj^s1nc(2t - 1) dt 


Thus, f(x) is an appropriately normalized fraction of area under the sine 
function. 
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I 



Figure A3. 10.1- In the uniform model the contribution to the ray 
sum is just that portion of the pixel which lies 
inside the ray multiplied by the pixel value. The 
variable x represents the fraction of pixel over- 
lapped by the ray. 
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For purposes of calculation it will be useful to simplify (A3. 10-1) 
First, observe the defining relation for the Sine Integral function 
(Abramowitz and Stegan, 1964). 


Si(x) = 


J sin t 
n t 


By (A3. 10-2) it is straightforward to obtain 


A 

ii(7Tx) » IT \ sine t dt, 


(A3. 10-2) 


(A3. 10-3) 


$inc(2t - 1) dt = - 57 -Si(ti) + | sine t dt (A3. 10-4) 


For 0 < X < 1/2 the integral on the right side of (A3. 10-4) is negative 
it is useful to take that into consideration explicitly; thus 


sinc(2t - 1) dt = Si(Tr) — Si(7T - 2nx), 0 £ x 1/2 

2 Si(ii Si(27TX - 7r), 1/2 < X < 1 

(A3. 10-5) 

We now observe that the denominator in (A3. 10-1) is Si(7t) and rev/rite 
f(x) in terms of the tabulated Sine Integral function. 


f(x) = 


I ~ 2 s I t WT ® ^ 

? ^g l^ TT " ) ' Si(2iTX - tt), 1/2 £ X £ 1, (A3. 10-6) 

(A3. 10-7) 


where Si(7r) = 1.8519370. (A3. 10-7) 

The values for f(x) were computed for 0 x 1 and were used to 

compute the per cent of the pv included in the ray sum which is given by 
f(x) times 100. In Figure A3. 10. 3 the results are plotted as a function of the 
fraction of overlap x. The straight line represents the percent of the 
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pixel value included in the ray sum for the uniform model and Is included 
for comparison with the graph for the sine model. 


A moment of thought reveals that the results of Figure A3. 10.3 hold 
even if the ray is not parallel to the pixel, provided x is normalized to 
represent the fraction of overlap for the more general case. 

In the case where the pv is distributed over the pixel as indicated 
in Figure A3, 10. 4 so that the zero crossings occur at half-power, corresponding 
to the midpoints of adjacent pixels, the function f(x) is modified to 

,( X) . . r . slpctt - V2)dt 

^jSinc(t - l/2)dt 

In terms of the Sine Integral function. 


f(x) 



2st[v/T)’ 

2Si(TT/2) 


0 < X < 1/2 
1/2 < X < 1 


(A3. 10-9) 


where 


Si(7T/2) = 1.3707622 (A3. 10-10) 

This "second" sine model is shewn in Figure A3. 10. 5 (dashed line) where it can 

be compared with the uniform model and the previous sine model. Observe 
that for the second sine model there is very little deviation from the 
uniform. 

The weighting function used by the MCSA is most closely matched by the 
second sine model and therefore, seems to be the most logical choice. However, 
since the difference between it and the uniform model is so slight it is 
recommended from the standpoint of the in^lementation that the weighting 
factors for the ray be based on the uniform model. 
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Figure A3. 10, 5 - Comparison Among Uniform (straight solid line), Second 
Sine Model (dashed line), and First Sine Model (curved 
solid line) 




The special way In which the Radon transform (Radon, 1917; Gel'fard, 

1966) Is applied to the GCV numbers or power spectrum for signal enhancement 
and linear pattern selection has been shown (Deans, 1979) to be equivalent 
to a transform known as the Hough transform to people In the area of 
pattern analysis and Image processing. This transform grew out of early 
attenq)ts to Implement automatic and rapid methods for analyzing particle 
tracks In bubble chamber photographs (Hough and Powell, 1961; Hough, 1962; 
Franck, Hough, and Powell, 1963). The basic 1<tea Is to designate the 
frequency-time space by the rectangle xy coordinate system, see Figure A3. 10.6. 
Col linear points In this space may be characterized by the line 

p * xcos 6 ysin 6 

This approach Is used to map col linear points In the xy plane to single 
cell regions in a space with rectangular coordinates (e,p) which we call 
parameter space (see Figure A3. 10.6(b)) . 

Calculation of Ray Amplitude Factors 

Usually, the discrete Implementation of R.-'don transform Is performed 
using line integrals or ray sums, where the rays are equal In width to the 
width of the pixel. In^)lementat1on constraints here dictate that the ray 
widths vary as a function of angle. (The horizontal distance Is kept 
constant and equal to one unit as Illustrated In Figure 3.9.) A method 

Is presented for computing the ray factors which does not require that the 
ray width be the same as the pixel width. A sample calculation is given to 
Illustrate the method for finding the ray factors associated with a given 
cell or pixel. 
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near Points in the xy Plane to a Single Cell 
pace. The Hough Transform. 
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F(x,y) » x(i+l»j+l) 


Given a unit cell (or pixel) at location (UltJ-i’l) In the grid 
Indicated In Figure A3.10.7 It Is desired to find the line factors and ray 
factors associated with the cell. The line factors may be found directly 
frcxn the Radon transform of the characteristic function 

whenever (x,y) e 
elsewhere 

In Figure A3. 10. 7 the 11f« factor Is just the length of the line segment AB 
(shown In Inset). The ray factor Is the area of the shaded region, (hice 
the line factors are determined It Is easy to calculate the ray factors 
by adding areas, see Figure A3. 10.8. 

The line factors for are found from the Radon transform 


C 


formula 




// x(1'*’l.j''’l) ^ ( 01 , ■ XCOS0 - ys1n0)dxdy 
D 


where the Integral over the domain 0 reduces to an integral over 

We now give a prescription for finding the various factors. Details 
of derivation may be found elsewhere (Deans, 1979). 

(1 ) Given 0 and pj^: 

Calculate p * ■ P|^ - IcosO - js1n0 

If p Is not In the region 0 < p < sino + cos0 the line does 

not pass through the cell and the line factor Is zero. 

(2) Use Table A3. 10.1 to calculate the line factors, fj^. 

(3) Use Table A3. 10.1 to calculate the ray factors, r^. 

(4) Once the ray factors are calculated there are various cases 

to consider In order to find the contribution made by the cell 
under consideration to the total ray sum for various rays. 
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Unit Cell. The Length of Line AB through the Cell is 
le Line Factor, and the Shaded Region is the Ray Factor. 


] 

1 

1 
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I 


Region for p 

line factor fj^ 

ray factor rj^ 

0 < p < sin 0 

P 

pV 

sine cose 

sin 20 

sine < p < COS0 

1 

2psine - sin^e 

cose 

sin 2e 

COS0 ^ p < sine + COS0 

sine + cose - p 

2p(sin0 + COS0) - (1 + p*) 

sine cos 6 

sin 20 


Table A3. 10.1 - Line and Ray Factors 
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For the ray width equal to unity (same as cell width) there are only two 
cases (a) and (b). Illustrated In Figure A3. 10.9. 

(a) Cell 1,j contributes to ray sum and The contribution 

to S|^ Is just the ray factor for the line The 
contribution to ^k+1 Is one minus the ray factor, 1 - 

(b) In this case a contribution Is made to S|^, and S|^^ 2 * 

The contribution to Sj^ Is rj^. The contribution to Is 
*^k+l ’ contribution to is 1 - 

A sample calculation Is shown for the Indicated cell In Figure A3. 10. 10. 
Calculation for C(21,7): 

Step 1: C(21,7) contributes to rays 22 and 23. 0 = 14.04° 

Pk “ P 22 " 22COS0 

p = P|^ - 2Ocos0 - 6s1n0 = 21.343 - 20.858 
p = 0.485 

Step 2 : Contribution to ray 22: 


"22 


2ps1n0 - sin^0 
s1?F0 


Contribution to ray 23: 


^ * 1 * ^22 ” *^25 


.375 


Phase Factors 
Overview 

There are two separate phase corrections that must be applied in 
order to properly combine adjacent time -frequency cells into the generalized 
coherence value along a given line. One of them is due to the phase of the 
signal itself, and the other is caused by phase shifts that occur In the 
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Figure 


NCSA. The two corrections are Independent, and can be made separately. 


Signal “Caused Corrections 

From equation (A. 28) we can write the polarization-matched signal 
as 




‘1 "1 


where 


f = frequency drift rate (Hz/ sec) 
f^ = Initial frequency (when 1 « 0) 

0^ = Initial (arbitrary) phase angle 

1 = Integer time Index; 1 = 0 initially; i increases in steps 
of one second. 

Note that the phase is a quadratic function of time. 

The generalized coherence factor involving z^ is 

N-1 


GCV 


i=o 


Using 




we obtain 


GCV 


GCV 


GCV 


N-1 . 

Z A. A. 2irf^(i + l) + O.^j) - (7tfi‘+ airf^l + 0. ) 

i=0 ’ 


i=0 


'i"i+r 


N-1 




i=0 


1 » n . . a 

y 1 + 1 


Since the magnitude of the phase term outside the summation is one, we have 

N-1 . .. 

GCV ^ Z A/, e'^^^'^i + r'V 

i=0 ’ ’ ^ 
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or 


GCV - 


N-1 

I 

i-0 


, , * .'•■jZirfI 

*i+ri * 


This Is the value that would be measured If no phase correction were made. 
In order to cancel the phase variation, we actually confute 


6CV ^ 


N-1 

I 

1-0 




(A3. 10-11) 

The linear phase Invariance property of generalized coherence thus 
requires that only a linear phase correction be made to a signal with a 
quadratic phase variation. 


MCSA- Caused Phase Corrections 

The phase shift Introduced by the MCSA Is derived in Appendix A. 

For a drifting signal, the phase shift depends on the path of the signal 
through the frequency bln during the Integration period. A good 
approximation to the phase shift is irZ^f^ radians, where Af^ Is the 
difference frequency between the signal and the center of the bln (in 
units of blnwidths), measured halfway through the Integration period. 

Table 3.5 compares the approximate and exact phase shift, for linear 
drift rates up to 1/2 Hz/sec, which is the maximum being considered for 
this system. 

The approximate method will be used for the remainder of this 
discussion, because It Is easier to visualize. The exact method assimies 
that the signal has Infinitesimally small bandwidth, whereas the approxi- 
mate method is somewhat more general. This con^)ar1son is analogous to the 
case of the amplitude weighting factors, where sine weighting is exactly 
correct for infinitesimally narrowband signals, but uniform weighting 
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gives nearly the same result, and Is applicable to wider bandwidth signals 
as well. Since the phase factors only need be computed once and then 
hardwired Into the system, the exact method of Appendix A could still be 
used If desired, following the general plan to be described here. 


Drift Range Exact method phase shift 
0<d<0.5 41?4 

0 < d < 0.2 16?9 

0 < d < 0.1 8?1 


Approximate method phase shift 
45° 


18 ^ 


Table A3. 10. 2 - Comparison of Exact and Approximate MCSA 
Phase Shift Conn^utatlons (See Appendix A) 


Let us first consider phase changes which occur as a function of 
time (I.e., from one Integration Interval to the next). Changes which 
occur as a function of frequency between adjacent frequency bins in the 
same integration interval will be considered subsequently. In Figure A3.10.il. 
the solid lines are the edges of the Radon band, the dashed line Is its 
center, and the dotted lines mark the halfway time point of two successive 
integration periods. The boxes represent time -frequency bins. Let us 
first assume that the signal is drifting exactly along the centerline of 
the radon band, at a drift rate of f. Using the approximate phase shift 
method discussed above, we see that the MCSA will Introduce phase shifts 
of tt(C-B) in bin 1 and tt(F-E) in bin 2. Since F=C+f and E=B, the shift 
In bln 2 Is also Tr{C-B+f). Thus the shift In bln 2 differs from that of 
bin 1 only by Tif. It may also be seen that the same difference would 
occur between bin 3 and bin 2, and between any two successive bins in a 
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Figure A3.10.il- MCSA Phase Shifts Between Successive Integration Intervals 
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vertical row. Suppose now that the signal was not at the center of the 

radon band, but at its left edge. The phase shift introduced by the 

NCSA is tt(A-B) in bin 1 and it(D-E) in bin 2. Since 0 ■ A+f and E “ B, 

• * 
this is also Ti(A-B+f), so again bin 1 and bin 2 differ only by nf. 

Thus it can be seen that the phase shift between adjacent integration 

periods does not depend on the position of the signal within the radon 

band. Furthermore the phase change from one integration period to the 

next is constant, which means the phase changes linearly with time. 

Since generalized coherence is invariant under such phase changes, this 

shift has no effect and can be ignored. 

Considering now the MCSA phase shifts between adjacent frequency 
bins in the same Integration interval. In Figure A3. 10. 12 the lines have 
the same meanings as in Figure A3.10.il. Points A, B, and C are the midpoints 
of bins 1, 2, and 3 respectively. Point D is the intersection of the 
center of the radon band and the center time line. The radon transform 
requires a weighted average of all bins lying within the radon band. 

The amplitudes of the weighting factor are the respective areas within 
the band, as discussed elsewhere. The relative phases must also be 
taken into account for proper combining of the bins. 

The position of the signal within the band makes no difference as 
was shown above, but some point must be chosen as a phase origin, so 
that all the bins can be adjusted to have the same phase. The logical 
choice for this is point D. The HCSA phase shift is tt(A-D) for bin 1, 
tt(B-D) for bin 2 and 7t(C-0) for bin 3. Thus the necessary correction 
factor is the negative of these, and would be applied in the form 


^i 




(A3. 10-12) 


- 342 - 



1 


Figure A3. 10. 12- MCSA Phase Shifts Between Adjacent 

Frequency Bins in the Same Integration 
Interval 
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where the c's are the amplitude weighting factors. It Is also useful to 
note that (D-A) - (D-B) = B-A » 1 and (D-C) - (D-B) - B-C - -1 (since the 
units are b1n>w1dths). Thus can also be written as 


2, . - Cjif) 


(A3. 10-13) 


Either method (A3. 10-12) or (A3. 10-13) mey be used to apply the necessary 
phase correction factors. 

The two NCSA phase corrections derived above may be combined and 
applied simultaneously. The composite result is 




jn(D-A-2fi) ^ ^ , Jir(D-B-2f1) . ^ , Jir(D-C-2f1) 

' ^ * Vc® (A3. 10- 14) 


for a given drift rate and starting point, the phase factors in equation 
(A3. 10-14) may all be computed in advance, and stored in a lookup tatole 
for real time computation purposes. 


Approximate Effect of Drift Rate Quantization 

Since only a finite number of drift rates are used in the CWD it is 
important to know what happens if a signal drifts at rate other than one of 
those chosen for analysis. The worst case is when the signal drift rate is 
halfway between two to those chosen. That case will now be examined. 

The CWD rate quantization chosen is such that the drift increment is 
the reciprocal of the total integration length (e.g., for 20 seconds 
integration, the drift increment is 1/20 Hz). 

A typical worst case is thus a signal with a drift rate of 1/40 Hz. 

Over the total integration period, it will drift half-way across a radon band. 
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Then the worst way In which It can do so Is to drift from the center to 
one edge. Since the combining operation among bins at the same time does 
not depend on the signal frequency, the combining will still be correct. 
However, the phase correction of the sum to the center of the radon band 
will have an error varying linearly from 0 at the beginning to ir/2 at the 
end of the integration period. Similarly the phase correction factor due 
to the signal itself will be in error ranging linearly from 0 to it, over 
the same period, resulting in a total error of 3 tt/ 2. The net result is 
that the terms of GCV do not all add up exactly in phase, and 

partial cancellation occurs. The ain>litude of the observed signal also 
changes as it drifts across the HCSA sine response function. The 
situation is exactly analogous to that given in Table A. 2, if it had an 
entry for 0 £ d £ 0.75. Plotting and interpolating Table A. 2 gives an 
amplitude value of 0.62 for this case. Thus the worst case signal loss 
is 2.1 db. 

For the same drift rate, the best case would be for the signal to 
drift from -1/4 bin, to +1/4 bin, relative to the center. Then the 
amplitude factor would be about .85, making the average be about 0.74, 
or 1.3 db loss. If we now calculate the average over all drift rates, 
we can estimate the average loss to be one half of this or about 0.7 db. 

In summary, by choosing the drift rate increment to be the 
reciprocal of the integration time, we incur an average decrease in 
sensitivity of 0.7 db, and worst case loss of 2.1 db. This loss is not 
substantial, and could be decreased by increasing the number of drift 
rates analyzed. 
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Approximate Detection Statistics 

In this section, approximate detection statistics for the CUD will 
be derived. The basis for this derivation Is the body of pre-existing 
knowledge roncerning the Radon transform. The derivation Is approximate 
in the sense that It does not explicitly Include the effects of 
Generalized Coherence. Based on the results of Section 3.3 above, 
these results may be considered as underestimates of the true performance 
of the CUD. 

The particular angle-radius parametrization (which comes out of 
Radon transform theory naturally) utilized by the CUD was not developed 
for the Hough transform until (Duda and Hart, 1972) pointed out the 
advantages in 1972. Since this work many papers on the Hough transform 
and more general Hough-type transforms have appeared (Shapiro, 1978; 
Sklansky, 1978; Shapiro and lannino, 1979). (Shapiro, 1978) and 
(Shapiro and lannino, 1979) have studied the Hough transform in the 
presence of noise with particular attention given to errors In parameter 
space which arise from errors in feature space. Noi*"e analysis of the 
MCSA output is discussed elsewhere in this report. The Important point 
here is that the transform which is being used has been applied to arrays 
of data which are in many respects similar to the properties of the 6CV 
and power spectrum data. The noise properties of the transform have been 
studied and it may be comforting to observe that the transform Is In some 
sense optimal in that it behaves as a matched filter (Sklansky, 1978). 
Noise analyses from a slightly different viewpoint(Hanson, 1979) 
demonstrate that once the projections (GCV numbers or sum over power 
spectra along rays of width a) are obtained for a fixed angle G, the 
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function Pq(x) which contains both signal and noise Information may be 
studied by standard techniques of communication theory (Middleton 1960i 
Middleton 1965; Whalen 1971). It follows that the optlmwn SNR Is given by 

^ o //T 

for a projection ray of width a and length L, for white noise with mean 
zero and variance o*. where the Integral is over the ray width. Observe 
that for signal S in k pixels of unit width this reduces to kS/o^ . For 
L » 1 and k « 1 we obtain S/o for a single pixel, and for L « k we obtain 
yirs/o. 

Assume that L is the integration period t^ of the OiO, Then, according 

to the above SNR formula the optimum signal power extracted Is given by 
k S 

^ yC This expression can be used to do some elementary analysis to 
investigate the set of conditions under which a signal can be detected If 
the angle of the drifting signal is not matched exactly by a projection 
ray Pq of the detector. The probability of detecting a signal can be 
analyzed with respect to the fraction of signal (k/t) picked up by the 
detector at various false alarm rates (a levels) and for different signal 
to noise ratios (S/o). In addition, the number of seconds needed for 
the integration period t in order to achieve a 99% probability of detection 
for different a levels and S/o is considered. 

The results given below are based on the following: The noise Is 

assumed to be a Gaussian (normally distributed) variable with mean 
power p/Hz and standard deviation o * p. (Note that we do not assume 
normal ization with zero mean at this stage.) The siinal Is assumed to be 
continuously drifting, with a constant mean power S/Hz and, if present. Is 
presumed to exist for the entire Integration period (t seconds). It Is 
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also assumed that the average power x (or 6CV) in an arbitrary ray 
(traversing a sufficient nuntf)er of data points) is nearly normally 
distributed with mean y and variance afta where t» the integration 
period, and a the width of the ray, are the dimensions of the ray. The 
standard deviation is, therefore, a- = o/t. 

The hypotheses to be tested are: 

= The power distribution (in the region) is due to noise alone. 
Hj = There is a signal in addition to the noise. 

The decision rule will be to reject if 7 > y + z^a- where z,^ 

the z-score associated with a false alarm rate a (i.e., Pr(z '> z^) = a). 
Therefore, the probability of detecting a signal when it is present is, in 
general , Pr(x > y + -). 

Now, under there is signal power in addition to noise so the 
— - kS 

expected value of x is x » y + ^ where k is the number of times the 
signal is in the ray during the integration periOu .. Since the signal 
power is assigned to be a constant, x is the same value as before 
(i.e. , o- = a//ta ). 

In the special case of the CWD a projection ray is approximately 

- kS 

1 Hz wide (a = 1) and t seconds long, so the value of x is y + -^ where 
is that part of the time a signal is picked up by a projection ray. 
y is the mean noise power, a— = and the signal to noise ratio is -■ 

(the average signal power divided by the average noise power). 

By definition a z-score is z = — where Y is the mean, y a 

specific value and Oy is tne standard deviation. In this case it is 
given by z = - - . Then: 1-3 = probability of detection = P^^^ = 
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(A3. 10-15) 


’’(let M ♦ i^oj) ■ Pr( ^ M * ^g°; - (h t ) 

■ PH z > z„ - ) 

■ 1 - PH z 

Table A3, 10. 3 gives the number of seconds needed In a time block to 
achieve a 99* probability of detection for various a, k/t, and S/p. 

Figures A3. 10. 13 (a), (b). and (c), and Tables A3. 10.4 (a), (b). and (c), show the 
probability of detection versus time for various k/t, P/u, and a levels 
using equation (A3. 10-15). 

Based on these statistics, the probability of detecting a signal 
after 20 seconds of integration when SNR » 1, the false alarm rate 
a = .01, and the angle of drift coincides exactly with the slope of some 
ray, is approximately 98*. If the drift angle is such that a projection 
ray picks up half the available signal, then the probability of detection 
for the same parameters specified above, degrades to 46*. There are 
several ways to increase the probability: one is to raise the false alarm 
rate, i.e. if a ® .05, then the probability of detection is increased by 
72%; another is to increase the integration time, i.e., if t = 40, then 
when a = .01, the probability of (tetection is about 85* and when a = .05 
the probability of detection increases to 94*. Longer integration 
periods are possible by combining the approximate GCV numbers (ray sums) 
every 20 seconds. 

It appears from the foregoing analysis that at least one projection 
ray must pick up half the signal, if present, in order to detect the 
signal. Therefore, the number of projection rays to include in the 
detection mask is a function of that constraint. 
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g »» .25 


S/N 

JjLL 

time(s8C.) 

.8 

1/2 

56 

.8 

1/3 

127 

.8 

1/4 

225 

a = .05 



.8 

3/4 

44 

.8 

1/2 

99 

.8 

1/3 

223 

.8 

1/4 

396 

.5 

3/4 

113 

.5 

1/2 

253 

.5 

1/3 

570 

.5 

1/4 

1014 

a = .01 



1 

1/2 

87 

.8 

3/4 

60 

.8 

1/2 

136 

.8 

1/3 

305 

.8 

1/4 

543 

.5 

3/4 

154 

.5 

1/2 

347 

.5 

1/3 

782 

.5 

1/4 

1390 

a = .001 



1 

1/2 

118 

.8 

3/4 

82 

.8 

1/2 

184 

.8 

1/3 

410 

.8 

1/4 

734 

.5 

3/4 

209 

.5 

1/2 

170 

.5 

1/3 

1058 

.5 

1/4 

1880 

g = 10 "’ 



.5 

1 

54 

1 

1/2 

215 

1 

1/2 

335 

.8 

1/2 

860 

Table A3. 10.3 - Approximate Nunfcer of Seconds 

Needed for the Integration 

Period to obtain 

a 99% Probability of Detection at Various 

g and S/N. 
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S/N * 1 a * 0.001 k - t 
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Probability of Detection 





Probability of Detection 


[ 



Integration Time in Seconds 
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1% 80% 98% 99.9% 

.1 53 92 99 99.9 


Va ~ ..♦Q 











57 89 

98 

99.8 





.1 

29 69 

90 

98 





,01 

12 45 

75 

91 

97.5 




SAel 

= .5 








23 46 

66 

80 

89 


98 



6 20 

36 

53 


78 


97 


1.7 7 

17 

29.5 


57 


90 

S/fl. 

= .3 








8.4 16 

25 

33 






1.6 4 

7.4 

11.7 







1 

2 






Table A3. 10. 4(a) - Approximate Probability of Detection for Various 
a and Integration Periods t, when k/t « 1 = 
Perfect Match 





.05 

76 

96 

99 

100 



.01 

52 

85 

96 

100 



.001 

24 

60 

85 

100 



0001 

10 

36 

66 

100 




S/a = .8 






.05 

60 

85 

95 

100 



.01 

33 

64 

83 

93 



.001 

12 

32 

58 

76 

94 

99 

0001 

4 

15 

34 

54 




S/a = .5 






.05 

32 

51 

66 

76 

89 

98 

.01 

13 

26 

39 

52 



001 

3 

8 

13 

24 

43 

75 

001 

.6 

2 

5 

9 




Table A3, 10. 14(b) - Approximate Prcrfbabili ty of Detection for Various 

a and Integration Periods t, when k/t = 3/4 
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0 10 20 30 40 so 60 70 80 90 100 



.05 

46 

72 

86 

93 97 



.01 

22 

46 

66 

80 



.001 

7 

20 

36 

53 

78 

97 

0001 

2 

7 

17 

30 

57 

90 


II 

O 

(/> 






.05 

35 

56 

71 

81 

93 

100 

.01 

14 

30 

44 

58 



.001 

3 

10 

18 

29 

50 

82 

.0001 

.7 

3 

7 





S/o = 






.05 

20 

30 

39 

47 

61 

80 

.01 

6 

11 

17 

23 



.001 

1 

2 

4 

7 

12 

28 

.0001 

.2 

.5 

1 

2 




Table A3. 10. 14(c) - Approximate Probability of Detection for Various 

a and Integration Periods t, when k/t » 1/2 
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Consider the situation depicted in Figure A3. 10, 14 where the signal 
lasts t seconds and falls within the ray labeled S which is a perfect 
match. Suppose the signal is traversed by a projection ray then the 
amount of the signal picked up by the ray is proportional to the shaded 
area A and A = 1/2 {(cos(20^) + l)cotAG - sin(20j)} where 0^ is the angle 
the signal makes relative to the vertical and A0 is the difference 
between the ray angle 0|^ and the signal angle. 

For the implementation of the algorithm the area of every ray is 
the same, no matter what the angle of the projection Is. Also, that 
area of a ray of width 1 unit and length t seconds is t square units. 
Therefore, in order to ensure that half the signal falls within the 
a projection ray we need to have 

t = (cos(20g) + l)cotA0 - sin(20j) 
so 

cotAO = t ^ sln(;e ^ ) 
cos (20^) + 1 

Now, as 0^ ^ 0, cotAG -* t/2, and as 0^ 45° , cotA0 -► ^ | ^ . 

If 1 = 20 seconds, AG is between 3° and 6° whenever 0^ is between 0 and 45°. 

Consequently, if the ray projections of the CWD mask are separated by 
3° it should be possible to detect a signal with an acceptable probability. 
Furthermore, when t * 20, tan"^ ^ which means that the ends of the 
rays overlap. 

A second consideration involved in the choice of the number of angles 
is the maximum amount of Doppler shift caused by planetary rotation for a 
planet that could support life. For a planet like the earth, the Doppler 
shift is .15Hz/sec. (CYCLOPS, p.57); for Jupiter it is 14 Hz/sec, and for 
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Figure A3. 10. 14- Proportion of Area in the Intersection of an 
Arbitrary Ray R with an Ideal Ray S, which 
Contains the Signal 
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an Earth-sized planet rotating at Jupiter's rate (8 hr) It Is 1.4 Hz/sec. 
(Machol. 19 ). 

Finally, there are Implementation and cost considerations. A 
reasonable choice, for a first approximation, seems to be 20 angles 
spaced 3° apart covering the range from -27° to +27°. 

Combination of Successive Generalized Coherence Values 

The sets of generalized coherence values calculated every 20 seconds 
are used not only for Immediate threshold testing, but are accumulated 
as time goes on to obtain additional results encompassing all the data 
received up to that time. 

Both the old and new sets of accimiulators each contain GCV's for 
each of eight million frequency channels for forty-one uniformly spaced 
drift rates between plus and minus one hertz per second. However, it 
is not optimal to simply combine corresponding frequency-drift-rate 
cells. 

When two sets of accumulators are combined, those individual GCV's 
are added together which could approximate the same uniformly drifting 
signal. For example a typical low level signal might be present most 
strongly In accumulator A from one set and accumulator B in the next set. 
Most likely A and B would have the same drift rate and the reference 
frequency of B would be offset from A by the drift rate times 20 sec. Note 
that the correspondence between one set of accumulators and the next 
depends on their individual drift rates. 

If a uniformly drifting signal had a drift rate that exactly matched 
one of the accumulators, the best way to combine the GCV's from the 
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previous time periods with the GCV's for the present 20 sec would oe 
to add the values of corresponding accumulators together. This would 
always add accumulators containing the signal together and would always 
add accumulators containing only noise to other accunnjlators that 
contain only noise. This would allow signal -containing accimulators to 
be most clearly distinguished from those that do not. 

However, a real signal could have any drift rate and would In 
general be expected to gain or lose frequency relative to the closest 
accunulator drift rate. In fact, from one set of accumulators to the 
next, a signal could go to the next higher or lower drift rate as well 
as the next higher or lower frequency. 

An initial step in resolving this airt>iguity is to offset the 
accumulator frequencies by one-half hertz in alternate 20 second sets. 

This reduces to four the number of accumulators in one set that could 
contain the same signal as a particular accumulator in the next 20 
second set. 

For a signal that is not clearly distinguished from noise one 
cannot be sure which of the four accumulators from the previous set might 
have contained the signal. The best estimate is to choose the one with 
the highest 6CV. Conceptually, the procedure is to add the maximum of 
four possibly corresponding accumulators from one set to each of the 
accumulators in the next set. This modified set of acc.Hnulators is then 
used in the same way to modify the following set and so on for as many 
sets as desired. 

Every accumulation produced from a combination of 20 second periods 
is tested against an appropriate threshold. Whenever a threshold is 
exceeded it is considered sufficient evidence that a signal is present and 
a flag is set for the control computer. 




1 




J 
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Hcm often have I said to you that when you 
have eliminated the impossible, whatever 
remains, however improb^le, must be the 
truth. 

--AC Doyle, Sign of F(Hir 


Appendix A3. 11 

Cluster [te tec tor Details and Statistics 


Step 1: The Histogram Analysis 

We initially subdivide the 50 x 250,000 compacted array into 1250 
pattern blocks and have dedicated processors opero< : jn each pattern block 
independently. Subdividing the compacted array vastly reduces the 
number of computations and, hence, hardware requirements, and does not 
result in a significant loss of information for the second step analysis. 

After subdivision, a processor establishes a histogram for its pattern block by 
simply "counting" the number of pattern points in each pattern class. If 
the nimiber of pattern points in some pattern class exceeds a preset thresh- 
old, the central computer is alerted. An excess number of points in a 
pattern class within a pattern block suggests the presence of a signal due 
to an unusual number of one or more NBIT tests scoring a pass ( = "1"). 

This possibility may be confirmed by the clustering algorithm of step 2, 
which detects the spatial structure of these pattern points within the 
pattern block. The rationale for counting the pattern points in the 
pattern classes instead cf counting identical types of pattern points 
within a pattern block is provided by the following example. 
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Suppose that in some pattern block there is a narroM band signal which 
consistently sets the same three bits for a 1000 seccNids. By our supposition, 
the signal will be present in 50 cells of the pattern block. A pattern point 
affected by the signal has six bits which remain unaffected. Each of these 
six bits has a 50% false alarm rate. Hence a pattern point affected by the 
signal has equal probability of belonging. Although the signal to noise 
ratio remains essentially the same, we have a clustering of 1294 - 1250 = 44 
points in this class as compared to less than one for the other histogram. 
Furthermore, the three 2-bit and three 1-bit subclasses of the above 3-bit 
pattern class will also show significant clustering. This provides us 
with another indicator as to whether or not a pattern class is affected 
by a signal . 

As was indicated in Section IIID4.5, the first stage clustering serves 

two basic functions. One is to compactify the data even further, in the 
sense that the input to the second step analysis cwsists of only those 

pattern points whose classes are deemed "interesting" by the first step. 

The second function of the first step clustering is to alert the central 
computer if some pattern class contains an unusually large number of 
pattern points. In order to achieve this objective we establish thresh- 
olds for the histogram such that whenever the number of pattern points 
in a particular pattern class exceeds its th res! Did, the alarm is 
activated. The levels of the thresholds are such that the false alarm 
rate will be within acceptable limits. We expect some readjustment of 
these thresholds as more data concerning the i nterdepenctency of the 9-bit 
tests becomes available during actual simulations. 


- 362 - 



Let i = 1,2, 511 identify the pattern class (we do not consider 

the zero or "no bits are on" pattern class). Let n(i) = 1,2,., .,9 the 
number of bits which determine the i-th class, N.* the number of pattern 
points in the i-th class/pattem block, and the threshold for the 

i-th class. If we compare the desired false alarm rate with tossing a 
coin once every 1,000 seconds, where "heads" corresponds to "no false 
alarm" and "tails" to "one or more false alarms", and we want the coin 
to be honest, then we must set 

Pr(x > 0) = 1 - Pr(x = 0) = 1 - (1 - ^ q 5^ 

where a = Pr(N^. > for at least one i) 

= Pr(at least one threshold exceeded) 
and x ^ b(1250, -). 

Solving for a, we obtain 

1 - a = (0.5) = 0.999445636 or 
a * 0.000554364 


We note that 

Pr(x = 1) = I250a(l - a)‘^-’ = 0.347, 

Pr(x = 2) = ^ a"(l - a)'^**" = 0.1201 and 

Pr(x = 3) = 1^50 X 1249 x 124 ^ ^3{i . 01) 12-7 ^ q.0277. 

Thus the probability of two or three false alarms in 1,000 seconds is very 
small. If a. = Pr{n. > ), then Pr(N^. < T^(i)) = 1 - ot^ . 
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Since Pr(N^ < V i * 1,2, ...,511) = 1 - a, nie have 

511 

n (l-a. )*(l”^**’*“l“tt or 
1=1 ^ 

(1 - a) = (0.999445636)*^®“ = 0.9999989148. 


The corresponding normal deviate is z-^ z 5. Therefore, 


^"(i) ~ “n(0 = 5 
“nO) 


or 


^n(i) " ^°n(i) ^ ‘*n(i) * 


**n(i) expected niniber of pattern points in the i-th class 

under noise alone and denotes the corresponding standard deviation. 

We Ctefine the sensitivity , associated with the threshold 

as the minimum nunt>er of cells/pattern block a signal must occupy in 

order for u / . \ > T , . v , where IL/i\ denotes the expected value of the 
n(i) n(i) nn; 

signal in random noise. If the signal occupies k cells, then from the 
inequality 

_ 10,000 - k ^ ^ T 

u / . » = — - — -7 ■ T k > i_ 


we obtain 


Hence, 


■*n(i) ^rUTT 

2"^’Hn(i) - 10,000 

_ - 10 . 000 ' 
^n(i) 2 "^“ - 1 


'n(i) 


, where 


[x] (tenotes the first integer greater than x. 

For a = 0.000554364 and z- = 5, we obtain the threshold sensitivities 

shown in Table A3. 11.1. 
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n(1) 


Hi(i) 

’’tid) 

*n(() 

9 

4.42 

19.53 

42 

23 

8 

6.32 

39.06 

71 

33 

7 

8.8 

78.13 

122 

43 

6 

12.4 

156.25 

218 

62 

5 

17.4 

312.5 

400 

91 

4 

24.216 

625 

746 

130 

3 

33.07 

1250 

1415 

189 

2 

43.3 

2500 

2717 

290 

1 

50 


5250 

1 

500 


Table A3.11.i - Threshold Sensitivities 
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step 2; Single Linkage Cluster Analysis 


This algorithm searches for spatially contiguous cells of a pattern 
type within a pattern block. Two cells are contiguous with respect to 
the i-th pattern class if they are within a pre-specified distance d^. 

If contiguous cells form a group such that the nunber of cells within the 
group exceeds a preset threshold T^ depending on the i-th pattern class, 
then a cluster exists and the central computer is alerted. 

The input to the algorithm are the pattern points of pattern classes 
deemed interesting by the histogram, where "interesting" is defined as 
follows. Let j = 1,2,. ..,1250 identify the pattern block, N.. = the 

• J 

number of pattern points in the i-th class and j-th pattern block, and 

set z. . = — , The interesting classes and pattern blocks are 
'J ^n(i) 

those which have large normal deviations z^j. Because of the compactifi- 
cation scheme, we have the real-time ability for approximately 100 searches/ 
10 seconds. Hence we may search for clusters in pattern blocks producing 
the 100 largest normal deviations. 

The implementation of cluster seeking is based on the general algo- 
rithm described next. The algorithm will vary somewhat according as to 
whether we are considering one and two bit pattern classes, or three and 
greater bit classes. For each large z... the algorithm establishes three 

* J 

lists. A.., B.., and C. .. Initially A., is a list of cell positions of 
all cells in the j-th pattern block which are occupied by pattern points of 
the i-th class; B^.j is a list of distances between all pairs of occupied 
cells sorted by ascending distance with ties arranged arbitrarily, and 
C^j =0, The algorithm proceeds in three steps. 


- 366 - 


1 . 


If the first distance in is greater than d^, stop. 

There is no cluster. Otherwise, remove the first entry from 
, and remove the cells corresponding to this distance from 

* J 

and add them to C^. 

2. Update the list B. . by computing the distances from cells in 

* J 

A., to the "cell" C. .. 

* J * J 

3. If every distance to C^j is greater than d . , return C.^ to its 
original state and go to 1. Otherwise remove from B. . all 

* J 

distances between cells of A^j and the cluster C.j which are 
less than or equal to d^. Remove the cells corresponding to 
these distances from A.j and add them to C^j. If the manber 
of pattern points in exceeds the threshold , stop. A 
cluster is found. Otherwise go to 2. 


Note: For a c d(a,C^j) = min id(a,c), c e C.j} 

We may obviously continue the algorithm in order to establish the actual 
size of C. . and/or for finding further clusters in the remaining list A... 

I J • J 

The actual implementation of the algorithm for three and larger bit 
classes is reduced to a simple scan and record procedure. A processor 
scans A^j and if a cell of A^^ does not have a preset nutrt>er N^. of 
neighbors within the distance d. , the cell is removed from A... After 

I I J 

all cells which do not have neighbors within d^ have been removed, the 
clusters remaining are then examined to see whether the number of cells in 
any cluster exceeds T.. We may also repeat the entire procedure on the 
clusters (Stained by redefining N^. Figure A3. 11. 1(a) shows the 2-dimensional 
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cn;C'NAL 

BLACK AND V/HITC F'MOiC^nAPH 


array representing where the i^^ class is a 3-bit pattern class, 
before the application of the algorithm, while Figure A3. 11.1(b) shows the 




Figure A3. 11.1 - Effect of the Single Linkage 
Cluster Analysis Alaorithm 



UKiCmivMI. HAG£ is 

OF POOR QUALITY 

The algorithm is scsmewhat modified for the one and two bit pattern 
classes. Since each bit has a 50% false alarm rate, half of all cells in 
a pattern block are expected to be occupied by any given 1-bit pattern 
class due to noise alone. This situation is illustrated in Figure A3.11.2 
where the "on" cells in the pattern block belwig to a particular 1-bit 
pattern class. 



Figure A3. 11. 2 - ® Pattern Block for the One Bit 

Pattern Class 

It becomes deer from Figure A3.11.2 that there Is an extremely high 
degree of contiguous ness between the "on" cells. In fact, one can find 
connected sequences of "on" cells forming meandering lines which stretch 
across the entire pattern block. 

Because of this high degree of contiguous ness, we are forced to set 
d. = 1 for one and two bit classes and to alter the clustering algorithms 
in order to keep the false alarm rate within a reasonable minimum. 

The altered algorithm is capable of finding clusters of cells 
containing horizontal and/or vertical line segments formed by contiguous 
cells, and of finding clusters containing rectangles of contiguous cells. 
For lines, there are two minimal alarm thresholds, T^ and ^ 2 * ^ ^2’ 

while for clusters containing rectangles there is only one, which we shall 
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denote by T. We set T 2 such that if the length of a contiguous 
sequence of cells in a row or colunm exceeds Tg, the probability of an 
alarm due to noise alone is extremely low. If T 2 is exceeded, we say 
that a cluster has been found and alert the central computer. The 
algorithm itself is reduced to a scanning and recording procedure and 
is accomplished in the following steps. 

1. Scan the first row of If a contiguous sequence of 

"on” cells of length greater or equal to is found add 
these cells to while removing them from A^j. Also 
remove all cells from A., appearing before this contiguous 

* J 

sequence. If the number of cells in C. . exceeds T^.stop. 

A cluster has been found. If there is no sequence exceeding 
Tp remove all cells in this row from A^j. 

2. Scan the next row. If another sequence of contiguous cells 
is found which exceeds Tj and is situated "below" C^j, that 
is, the vertical projection of C. . into the row below is 
contained in the sequence or the vertical projection of 

the sequence into the row above it is contained in C.^, remove 
the cells from A^j and add them to Otherwise, return 

to its original state and go to 1. 

3. If the number of cells in C.^ exceeds T, stop. A cluster has 
been found. Otherwise go to step 2. 

If no clusters are found and A., is erased, we repeat the algorithm 

* w 

by refreshing A^^ with its original data, and by replacing "row" with 
"column". That is, we scan along columns. Obviously, the algorithm stops 
whenever all the data fron A^^ has been exhausted. The effect of the 
algorithm is shown in Figure A3. 11. 3. Here (a) represents a pattern block 
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(a) - Before 


(b) - After 


Figure A3. 11.3 - Effect of the Modi f led Si nqle Linkage 
Cluster Analysis Algorithm for a 
One-Bit Pattern Class 


Our final goal is to obtain a method for establishing some of the 
thresholds discussed above. Since we are interested in clusters formed 
by contiguous cells, the first question which arises is as to the proba- 
bility, due to noise alone, of obtaining at least r contiguous (d^ = 1) 
cells belonging to the same class in a row (column) of 200 (50) cells 
long. Calling a sequence of r contiguous cells in a row of n cells a 
run of r successes , we are asking for the probability of having a run of 
r successes in n trials, where n = 200 or n = 50. 

Let P. denote the probability of the event that a cell contains a 

pattern point of the i-th class due to noise, and let P denote the 

n ,r 

probability of a run of r successes in n trials. Then P is given by 






where 


n,r n,r 1 n-r,r 




r)k 


and m is the integer truncation of -- i v . Values of P„ „ for n * 200 
are given in Table A3. 11.1. We computed P^ ^ for as many as fifty conti- 
guous cells, an event which would certainly trigger alarms set in previous 
stages and in step 1 clustering. Values for and J 2 can be chosen from 
the r column of Table A3. 11.1 with the corresponding P^ ^ providing the 
false alarm rate. A rough estimate for T can also be computed with the 
aid of Table A3. 11.1. Thus, if we choose = 8 for a 1-bit pattern class 
and multiply P„ 3 ^ times, we obtain a rough estimate as to the probability 
due to noise, of obtaining a T = 8 x k rectangle of contiguous cells. 
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Table A3. 


11.1 - Probability that a contiguous string of r cells, contained 
In a linear string of n cells, will exceed the detection 
threshold due to noise alone. Three exanples are shown. 


P^ « 0.03125 


n 

r 

Pn r 
n,r 

200 

2 

1.7E-01 

200 

3 

5.8E-03 

200 

4 

1.8E-04 

200 

5 

5.6E-06 

200 

6 

1.7E-07 

200 

7 

5.4E-09 

200 

8 

1.7E-10 

200 

9 

l.OE-11 

200 

10 

8.8E-16 


P^ « 0.00781250 


n.r 

200 2 l.lE-02 
200 3 9.3E-05 
200 4 7.2E-07 
200 5 5.6E-09 
200 6 4.0E-11 
200 7 1.7E-15 
200 8 1.3E-17 
200 9 l.OE-19 
200 10 8.4E-22 


P. = 0.00390625 


200 

2 

3.0E-03 

200 

3 

l.lE-05 

200 

4 

4.5E-08 

200 

5 

1.8E-10 

200 

6 

3.5E-15 

200 

7 

1.3E-17 

200 

8 

5.4E-20 

200 

9 

2.1E-22 

200 

10 

8.2E-25 
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Appendix A3 . 12 

An Evaluation of Some Classical Algorithms in the SETI Context 
1. Matched Filters 

A possible SETI signal detection system might be constructed 
utilizing an array of matched filters. A matched filter is an algorithm 
for the coirt)ination of information from the MCSA in such a manner as to 
maximize, for a given expected set of signal characteristics, the peak 
FX)wer to root>mean-square (rms) noise power. 

It can be shown (Schwartz, Bennet, and Stein, 1966) that if a signal, 
represented as a function in time by f(t), passes through a linear filter, 
whose transfer function is H(w), that filter is said to be matched if: 

H(u) = F*(w) 

where F*(u) = complex conjugate of the Fourier transform of f(t), 
t(j = time of occurrence of the peak value of f(t) 

This filter performs the function of; 

a) weighing the spectral components in the filter in proportion 
to the spectral components in the signal; 

b) aligning all spectral components in the filter in phase. 

This is a type of conjugate matching. 

If such a filter can be implemented, the ratio of the peak signal power to 

rms noise power, assuming white noise, is 

S . 2E 
N ‘ KT 


where E = the total signal energy 
K = Boltzmann's constant 
T = system noise temperature 
Thus, the detectability of a signal 
of the form of the signal. 


through a matched filter is Independent 
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It Is possible to Ingress Information onto an ot^rwlse monochromatic 
continuous carrier In a number of Mays: ai^)11tude. phase, freqtMncy, and 
polarization. Terrestrial technolooles have used all four possible 
modulations to some extent In order to convey messages via the electro- 
magnetic radiation. One might expect that It could be a straightforward 
exercise to implement a large set of filters, a group for each modulation 
scheme, which would respond. In sone optimal way, to extraterrestrial 
modulated RF energy. In order to acctw^lish this, however, one must 
know a priori , the form of the message contained within the modulation. 

Since this is quite obviously impossible, this direction for a SETI 
detector Is not reasonable. 

Me expect that an ETI signal, (teliberately radiated for the purposes 
of detection (a beacon) by other civilizations, would be a "single" 
signal in order to increase the likelihood of detection (Dixon, 1973). 
Simplicity, in terms of detectability, implies extreme spectral purity 
(Seeger, 1971; Oliver and Billingham, 1971). Further, many forms of 
terrestrial modulation use the redundant information contained within 
the monochromatic carrier to facilitate coherent detection. Therefore, 
the strategy is to attempt to detect the narrowband carrier, and to 
leave the examination of possible modulation sidebands, however they may 
manifest themselves, to a purely statistical approach. The advantage of 
this latter point is that we search the fCSA output only for possible 
departures from noise statistics. 

Signals which are pulsed in a time scale shorter than the reciprocal 
bandwidth of the MCSA (1 Hz) present special problems to the signal detector. 
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The present design of the HCSA precludes optimum detection of pulses 

b<>tMeen time frames from 1 second to > 1/8000 second, and from ? l/^KK) 

.g 

to -10 second, the reciprocal IF bandMidth. Pulsed signals In those 
realms must be found by an analysis of the time domain signal , ratter 
than the frequency <toma1n signal as presently envisaged In the HCSA. 

2. Walsh Transforms 

As a cartoonist captures the 1n4>ortant features which make a 
simply-drawn caricature recognizable. It would be desirable to have a 
transform which would extract —from the NCSA output —recognizable 
features necessary to Indicate the presence of a signal. Walsh functions 
and transforms were explored as possible candidates for that transform. 

The Walsh functions may be teflned as an orthonormal set of square 
wave functions. 

The Walsh function has t«M} argwnents: n Is the order and o Is 
normalized time. The orter n Is related the the nunri>er of zero crossings 
in the interval over which the function Is defined. (See Fig. A3.12.1 taken 
from Harmuth, 1977). 


I C~^ I I I, 

■ _J- - I ! ' ' ' ‘ I' I— 

- FbF b - 

■ UU - 

nfL F- — 

- JlFtf lr ■' 

- rUm Fh - 

Figure A3. 12.1 - Walsh Functions, Continuous and Discrete 
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Each Walsh function may be expressed as a product of odd symmetry 
periodic square waves of air^lltude ^1. (ftademacher functions) 

Walsh functions may be transformed into the "sequency" domain, where 
sequency Is the nunber of zero crossings. A sequency spectrum plot would 
Indicate the nun^er and magnitude of the coefficients necessary for a 
-Malsh exoanslon. (See Figure A3.)?..’ taken from Alexandrldls. 1971) 

(Schreiber and Sandy, 1974). 

Among the applications of Walsh functions are: 

Pattern recognition systems 
Image compression 
Design of radar waveforms 
Voice processing 
Signal multiplexing 

Walsh analysis has several advantages over sinusoidal analysis. 

(1) Walsh functions contain real Integral mmibers of two levels 
(*1). This makes them directly compatible with digital 
integrated circuitry. 

(2) The Walsh function set Is a natural describer of pulse-like 
signals. This property may be used to extract features from 
binary patterns (Alexandridls, 1971). 

(3) The transform from Input data to Walsh function series 
expansion coefficiclents Is faster than the FFT. For N 
samples, the FFT takes 2Nlog2N multiplications and additions 
(Blachman, 1974). The FWT takes only N1og2N additions 
(Harmuth, 1977). 
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Furthermore, techniques are available for obtaining two dimensional 
WaUh transforms In real time without waiting for all samples to be 
scanned before starting the transform operation (Alexandridls, 1971). 

Three potential applications of Walsh functions are applicable In 
this context. 

First, It Is possible that the ETI signal may be a transmitted 
electromagnetic Walsh wave. The transmission of Walsh waves has been shown 
to have certain advantages In applications such as radar (Schrelber and 
Sandy, 1974). However, waves traveling through space would be filtered by 
the Interstellar medium. A sinusoidal signal may be attenuated or phase- 
shifted, but the sinusoidal character would be preserved. A filtered 
Walsh wave would assume a significantly different characteristic. 

Secondly, it is possible to analyze the MCSA output using a two- 
dirrensional transform to pick out periodicities in time. The two- 
ditrensional Walsh transform Is attractive since the Implementation of the 
fast Walsh transform is less expensive and faster than the FFT. However, 
for long samples of smooth signals, more terms are required In the Walsh 
series representation and greater accuracy Is required of their 
coefficients for a given RMS total error (Blachman, 1974). 

Finally, Walsh transform techniques are a possible approach to the 
recognition of binary patterns in the MBIT two-stage data compaction 
scheme. Axis symmetry properties of Walsh functions can be used In 
designing a feature extractor with a resulting reduction In dimensionality 
In the feature space (Alexandridls, 1971). However, due to the dis- 
advantage of the Walsh transform being position dependent, position 
normalization of the binary pattern is necessary. In addition, specific 
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binary patterns which indicate a high probability of "signal presence" iiHist 
be defined. 

Conclusion: 

Although the Walsh transform may lead to significant savings of hardware 
and processing time in analyzing binary or pulsed data, it seems not to be an 
all-purpose pattern recognizer. The Walsh transform was initially considered 
a candidate for serving as a "cartoon transform"; that is, a transform that 
would show strong narrowband repsonse in the presence of an edge, outlined 
form, or filled-in pattern. Thus, while the axis symmetry properties of Walsh 
functions may be used to recognize specific patterns, but apparently provide 
no clue to the presence of signals of unknown characteristics. 
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Appendix M.l 
Large Memory Systems 


A4.1.1 » Regulreiwents 

As Is the case with many other dimensions of this study* the memory 
requirements of tte proposed system are quite unusual. The two basic parameters 
of the memory requirements are the memory size and the memory readkwrite and 
access times that are required to handle the amount of data and the data rates 
that will be encountered. 

(kie large memory system Is required for the storage of the digitized 
Input signal. The analog signals being received from each antenna feed are 
sampled at a 16 x 10^ Hz rate for the 1000>second observation period for each 
star, and quantized Into 8-b1t samples. This information must be stored, at 
least tenporarlly. In case a suspected ETI signal Is discovered. The impli- 
cation of this requ1ren«nt is that storage (n«mory) Is required for 2.56 x 
10^' bits of digitized incoming data. These data arrive at a 2.56 x 10* bits/ 
second rate. 

A second large memory system Is necessarily involved with the output of 
the dual, 8 x 10* channel multi-channel spectrum analyzer (NCSA). If all the 
data being output by the dual HCSA for 1000 seconds is to be stored ten^orarlly 
for use by the ETI signal detection algorithms, the total required storage 
capacity is 0.512 x 10** bits. 

It rapidly becomes apparent that the hardware required to randomly access 
0.512 X 10** bits of memory In sufficient time to perform all the analyses used 
to detect ETI signals Is prohibitively costly In dollars, power, and space 
requirements even If read-write and access time limitations allow realtime 
processing. For this reason, the output of the dual HCSA Is truncated for some 
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of the algorithms to 4 bite each for tee real and Imaginary components of 
each feed; and further, we require ten^orary storage of "only" teenty seconds 
of MCSA output at a time. Thus, the data transfer rate Into this memory Is 
reduced to "only”; 

2 X (4 4) X 8 X 10* ■ 1.28 X 10* bite/second 

and a storage of "only": 

20 X 1.28 X 10* » 2.56 x lO’ bits 

Is required. 

A4.1.2 - Types of Memories 

The types of memories which are available to dram from for the various 
memory systems in our syst«n are, at least theoretically, quite varied. These 
Include: 

Semiconductor Random Access Memory (RAM) 

Magnetic Tape Systems 
Magnetic Disc Systems 
Read Only Memory (ROM) 

Charge Coupled Devices (CCD) 

Magnetic Bubble Memory (MBM) 

Factors which eliminate some of these memory types from further consideration 
include the bit storage capacity of currently available units as well as those 
projected to be available by about 1982, current and future read-write rates, 
access times, and, of course, the jjres^rit cost per bit of storage as well as 
projected future costs. 

Less Conventional Memories 

The last three types of memory listed above, the less conventional forms 
of memory, were eliminated from further consideration in this system for 
various reasons, including the desire to utilize a technology which has been 
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well -developed and proven to be a reliable means of data storage and retrieval. 

Charge coupled devices (CCD) appear to be In the research and development 
stage continuously, and seldom get Into production. 

Magnetic bubble memories (^^M) seen to be moving ahead very rapidly with 
the immediate objective being to replace disc memories In the hone computer 
market, (Mavlty, 1979) and (Waller, 1979), In the very near future. However, 
from a cost-effectiveness viewpoint, bubbles were ruled out as a viable alter- 
native at this lime. 

Read only memory (ROM) are employed In many of the ETI detection algorithms 
and/or as look-up tables, but they cannot play a role In the overall large 
memory system picture. 

Exotic Memories 

Additional exotic memory forms were investigated also. Optical recording 
has the capability of storing 10’ bits per square Inch. As attractive as this 
medium appears at first glance. It has the disadvantage of being a non-eras able 
medium, and thus It is usable only for archival storage In this project, 

(Hoag land, 1979), (Kenney, 1977), (Kac2orowsk1 , 1977), (Kenville, 1978), and 
(Tufte, 1973). 

Electron Beam Accessed Memory (EBAM) shows a good deal of potential, with 
densities of 10*^ bits per square inch possible, (Smith, 1978), (Spellotls, 
1976), and (Hughes, 1975). The most sertous fault with this technique Is that 
to date no operating EBAM system has been built. 

Holographic memories also present a possibility, but to date the memories 
are non-erasable and consequently are useful for archival storage only, (Glllls, 
1975), (Harris, 1979) and (Honeywell, 1979). 
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Random Access Memory 

Recent activity Involving solid state random access memory (RAM) has been 
primarily limited to decreased access time with an Improvement by a factor of 5 
having been achieved over the past two years, (Hewlett Packard, 1979). Another 
advance In RAM technology is the development of a very large semiconductor 
RAM-based memory by means of Adaptive Wafer Scale Integration (/WSI) as reported 
by Geldeman (1978), Hsia (1979) and Brinton (1979). A 1.2 x 10* bit version of 
the AUSI memory Is In development and Is scheduled for production within a year, 
with predicted cost to remain at about the present .03 cents/bit for semi- 
conductor memory, (McDonnell -Doug las, 1979). 

Disc Memory 

According to Hoagland (1979), the disc remains the primary form of mass 
storage. Despite competing technologies, improvewents in disc technology will 
probably ensure its widespread use throughout the remainder of the century. 

A disc system has been developed and is presently available which has 
multiple (20) read-write heads and recording surfaces which allow for 8 or 
9 bits of parallel Input or output data, (Ampex, 1979). This system can store 
2.4 X 10 bits and achieve 8.7 x 10 bits/second data transfer rates. 

As Hoagland (1979) also mentions, reduced cost per bit in all technologies 
derives primarily from an increase in density on the material being used for 
storage. If one were to double the bit packing density on a magnetic disc unit 
of the type described above, and then double the number of bits that are 
available in parallel for read-write operations, one would have the disc system 
presently being investigated in the Simulation Facility at Ampex, Redwood City, 
Calif. (1979). In any event, it is anticipated that the magnetic disc industry 
will be going to a double bit density magnetic mediimi. Also, the higher the 
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number of parallel outputs described abo\^ will be feasible In the near 
future. This system would have a capacity of 4.8 x 10* bits per disc pack and 
a potential data transfer rate of 3 x 10* bits/sec. 

Tape Memory 

In terms of providing for the storage of massive amounts of data the 
most widely used medium Is likely to continue to be magnetic tape. With 
Increased recording densities and Increased numbers of tracks per unit tape 
width, the amount of storage represented by a reel of magnetic tape Is 
exceedingly large. The packing densities that are presently being utilized 
are on the order of 10* bits/square Inch of tape (RCA, 1979), with one system 
under development which uses 180 tracks across a 2-1nch magnetic tape. The 
result Is a tape capacity on the order of 5 x 10*' bits, using a 2-1nch, 
10,000-foot tape (Bell and Howell, 1979). 

A new tape system under development by Ampex (1979), which utilizes a 
flying head technology, is scheduled to be available by 1982, This system has 
a capacity of 1.2 x 10** bits and is appropriately designated the Ampex Super 
HBR (high bit recorder). 

In tei^nR of rapid random access to stored data, the tape systems are use- 
less, with access times to the center of the tape running up to two and a half 
minutes. But, In terns of quantity of storage, the tape unit Is by far the 
least expensive storage medium. 

Table A4.1 is a -ompllation of the characteristics of memory units available 
now and those envisioned to be available from three to six years hence. 
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By 1982-1985 1979 


Character- 

Mag. 

Mag. 

Solid State RAM 

RAM-Based 

Istic 

Tape 

Disc 

Static 

^mini 

Memory System 

Max Capac. 
per Unit 
(bits) 

4 X 10^^ 

2.4 X 10® 

1.6 X 10^ 

6.4 X 10^ 


Read-Wri te 
Rate 

(Bits /sec) 

6 X 10® 

8.7 X 10^ 

1.6 X 10® 

1.6 X 10® 


Access 

Time 

(sec) 

* 

120 

it* 

2.8 X 10“^ 

lO-SO X 
10-’ 

100 X 10‘® 


Cost 

(t/bit) 

1.5 X 10“^ 

0.003 

0.1 

0.03 


Max Capac. 
per unit 
(bits) 

1.2 X 10^^ 

4.8 X 10® 

6.4 X 10^ 
(1985) 

1.28 X 10® 

( 1983) 

1.2 X 10® 

Re ad- Write 
Rate 

(bits/sec) 

7.5 X 10® 

1.7-3 X 10® 

1.6 X 10® 

1.6 X 10® 

1 X 10® 

Access 

Time 

(sec) 

* 

150 

it* 

2.8 X 10'^ 

10-50 X Q 
10"^ 

80 X 10"® 

1-2 X 10”^ 

Cost 

(C/bit) 

2 X 10"^ 

1.5 X 10"^ 

C.05 

0.015 

1 

0.03 


* to middle of tape 
** average 


Table A4.1 - Memory Characteristics 
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Appendix 4.2 Discussion of Present and Alternative Architectures 
In this appendix we will discuss briefly other signal detector architec- 

2 

tures that might be more appropriate to NCSAs which output data at a rate •v 10 
lower or 'V 10^ higher than the 8 Megawords per second rate of the Oasis System NCSA. 
A4.2.1. - Architectures for Smaller Scale NCSAs 

With a throughput rate dropping from millions to thousands of data per 
second, many powerful processing possibilities are opened up. Floating point 
nianbers rather than integers could be processed by programmable Floating Point 
Array Processors, now commercially available. The processors would be ideally 
suited to the high speed analysis of data because they possess the speed of 
dedicated hardware units and yet are programmable, and could be used to imple- 
ment many of the algoritlwts described in this report. Their advantages are: 

- Floating point preserves the dynamic range, eliminating clipping losses. 

- Hardware building cost is traded for lower software designing cost. 

- The software algorithms may be easily altered or replaced. 

The reduced throughput of a smaller NCSA might also allow: 

- Storage and random access of an entire observation on a few conventional 
disc drives (while 20 would be required for the 8 MHz NCSA). 

- The possibility of human preprocessing, since a larger percent of an 
observation could be viewed with a display. 

- The data of an observation could be stored in RAM metrories (say 64 Kbits 
each) and a network of minicanputers could simultaneously operate on 
each RAM employing either Identical or varied detection algorithms, i.e. 
network processing. 

A4.2.2 - Architecture for Larger Scale NCSAs 

For NCSAs on the order of 300 Mega-Channels per second, processing of the 
entire array with the algorithms of the Oasis system becomes nearly impossible. 

A different approach would be to sample and analyze the output of selected 
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''groups" of channels sequentially. In otl^r Mords« If ^ limit the total search 
to one for persistent carrier waves, pulses, etc., then we need only check a 
set of adjacent channels periodically for tiM presence of such a signal. The 
processing required can thus be arbitrarily reduced, but t1^ price for this 
reduction Is a loss of sensitivity to ephemeral signals. Thus, for the Oasis 
syst«n to analyze the data from a spectron«ter with 3CK) 10^ channels output per 
second at an 8 NHz rate. It Is first necessary to consider every 40th block In 
the first row of blocks, and then to consider each 40th block offset by one In 
the second row of blocks. Random selection of blocks may be ttK>re desirable 
so as to avoid systematic omission of Identically drifting spectral features. 

Finally, the signal detection algorithms for such a large bandwidth spec- 
trometer might be justifiably limited to simpler tests, such as employed by the 
pulse detection algorithm, and the total pwer detection algorithm of the NBIT 
processor. 

A4.2.3 - Critique of the Oasis Design: Weaknesses 

The proposed Oasis System Is a rather cwnplex system. The Summer Study 
Group as a whole consciously declined from rejecting any potentially powerful 
algorithm simply on the basis that It made for a more ccmiplex final system. 

Every attempt was made to evaluate the relative merits of each algorithm, yet 
time did not allow for a truly fair and ccwplete evaluation and ordering of the 
v/orth and merits (both as signal-specific, and signal -arbitrary tests) of the 
dozen or so different algorltNns Implemented. While much work was done toward 
this end, much more remains to be done. It Is reccmmended that further com- 
puter simulations of these algorltlwis be conducted, upon a more varied (In SNR 
as well as type) set of earth-civil Izatlon-type signals as well as artlflcally 
concocted signals In noise. If nothing else, such a study would yield the rela- 
tive weights to place on the alarm thresholds, as discussed In Chapter 5. To 
the hardware designer wishing to construct a sensitive, yet simpler and cheaper 



system, the Oasis System, as It stands, represents a menu or shopping list of 
algorithms. This hypothetical designer Is In turn given from this report the 
plans necessary for Implementing and Integrating any subset of the algorithms 
deemed most effective. 

The decision to retain 16 output levels (4 bits) of real and 16 levels of 
Imaginary signal amplitude, for each polarization In the Oasis System was made 
without a thorough evaluation of the resulting clipping losses Inflicted on each 
algorithm. The decision was based Instead on the high cost of providing another 
set of 4 high density disc drives for, say, 8 bits of amplitude resolution. The 
6CV and ANOVA algorithms were tested through simulations for clipping losses 
and were found to be only slightly degraded by 4-b1t clipping (see the appendices). 
Similarly, the proposed simulation of all Oasis algorithms should Include a 
check for the clipping sensitivity of each. 

A4.?.4 - Critique of the Oasis ttesign: Strengths 

Chapter 3 has described the first and second stage processing of the 
6 1 

entire 8 10 x 10 element array. It detailed how the array was divided Into 
blocks, each tested and compacted Into 9-b1t words forming the elements of pat- 
tern blocks for pattern recognition and cluster detection. The scale of the 
array subdivision test Is gradually Increased In each of these stages. This 
Is shown in Table A4.2.1. 


Table A.4.2.1 
Algorithm 


The Increasing Scope of the Major Search Algorithms 
Items Tested 


Number of Array Elements 
Considered at Once 


Pulse Detector (1st pass) 

Individual elements 

10^*^ 

Carrier Wave Detector 

60 elements /si ice 

NBIT Tests 

20x40 element block 

10^ 

Carrier Wave Detector 
(Accumulated) 

3000 element slice 

10^ 

Cluster [tetector 

200x50 elements, each a block 

10®*® 

(Size of Entire Array 
for comparison) 

8 10® X 1000 

10®*® 


* each slice Includes 3 elen^nts per row x 20 rows of the entire array 


This slowly Increasing scale or scope of the Oasis search, especially with 
such signal -arbitrary algorithms as N61T and Cluster detection Is considered 
a major strength of the system. This table also Implicitly gives a rationale 
for the sizes chosen for Block and Pattern Block arrays. 

A second strength of the system Is found In the transpose hardware system 
which. In real time, arranges the data Into a much more desired form within a 
memory unit. To search effectively for any signal, an individual channel's 
output over time must be stored in a memory device contiguously and analyzed 
in a like manner. The array transposer accomplishes this for the algorltNns, 
for the archives, as well as for the operator's perusal with an advanced image 
processor. 

Another strength of the system !s felt to be found in the Generalized Co- 
herence measure. This algoritiwn or measure seems to implicitly embody many 
fundamental SETI principles while providing a new means of compressing the out- 
put of an MCSA. Since it is a succinct measure of polarization, phase, and am- 
plitude coherence of received time series, it serves as a detector for those 
qualities expected to be characteristic of intelligent communication. 

Finally, the Oasis System places a good deal of emphasis on the capability 
of the operator to detect and identify signals. For this reason, the role of 
the operator has been given a wide latitude, including his/her facility of view- 
ing and interacting with all stages of the hardware processing. Indeed, to use 
the operator as the learning part of the system, capable of modifying the per- 
formance of the rest of the system and possessing complete control over the 
all-important archiving decision, we believe greatly enhances the performance 
of the total system. 


Appendix A4.3 

Implementation for (1) Degree of Polarization and 

(2) Broadband Pulse Detection (with GCV) 


Since these two algorithms require some common computation, they share 
a common front end processor which passes the values on to the two subroutines: 
m*routine, and GCV-routine. Conceptually, there are three processing 
conytonents: the front end processor receives the incoming data, computes and 
accumulates the power associated with each polarization, the cross product of 
the two polarizations, for corresponding data points, and the auto- and cross 
products for two adjacent frequency data points. After 40 frequencies have 
been processed, the first three sums are sent to the m-routine which calculates 
the degree of polarization. The other sums are passed along to the GCV 
routine. In addition, the front end processor computes three other factors used 
by the GCV routine; normalized power factors for each polarization and a phase 
coherence angle (6). 

The m-routine calculates the degree of polarization in each 40 Hz row of 
data according to the formula: 


m 


/Ulxil^ - * 4 lExiytP 

( E|x^l^ + E |y.|* ) 

The result is compared to a predetermined threshold and, if it exceeds the 

threshold, the routine sets on the 1-bit and sets an alarm for the central 

processor. 

The GCV routine calculates the general coherence value for each 40 Hz 
row of data according to the formula: 
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GCV • 


1 

"FT 


( Kj'zx^^jXf KjKgCe^^Ex^^jyf + e’^^lly^+iXf) + Kg^Sy^^jyf ) 


where 


1 ■ N V|x,|‘i!*'|y,T 


K, « 


1 T 


'4 v'|x,|^^j' |y,T 


and 


, Im Zx.yt 

6 = tan ^ — 

Re J^x^yt 


Part of the processor is operating on the ini'.omlng data while the rest of the 
calculation is being processed in parallel. The entire processor is a multi- 
stage pipeline. It is outlined in Figure A4.3.1 and presented in Figures 
A4.3.2 - A4.3.4. 
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Figure A4.3.1 

Schematic Block Diagram for: 

(1) Degree of Polarization by Row and 

(2) Degree of Generalized Coherence by Row 


Front End Processor 



In Parallel with Stiges 3 
Stages 3 - 8: m-RoitIne 


In Parallel with 
Stages 6-12: 6 


Compute the degree of polar- 
ization m and set l-b1t and 
CPU alarm If m > threshold 

/(£|x^|^-E|y^|M^ ^ 4|£x^y*| 


Compur-e the Generalized Coherence 
Value, GCV, and set l-b1t and CPU 
alarm If GCV > threshold 


KjK2(e^^£x^^jy* ♦ 
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Figure A4.3«2 - First Stage of 6CV by Rows 























figure M.3.4 - Parallel 6CV by Rom 
Second Stage also computing WIT Degree of Polariiation 





Appendix A4.4 
Survey of Image Processors 


This appendix sunnarlzes the results of a survey of conmerclally 
available digital Image processors. The evaluation of each system was 
based upon Its potential role as the operator-machine Interface In the 
Oasis signal detector. The survey Included careful examination of manu- 
facturers' literature, personal conversations, and demonstration of opera- 
tional Image processing systems. Two previous surveys exist. (LaPado, R. , 
Reader, C., Hubble, L. , ESL engineering report, ar^ Allen, R.J., and ter 
Horst, H.J., Excerpt from Market Study of Several Image Processing and 
Display Systems, University of Groningen). 

The current trend In state-of-the-art Image processor architecture Is a 
bus-oriented, pipeline array processor utilizing function tables and memory 
multiplexing techniques to maintain video speed (10 MHz) while performing 
various operations on Image data. 

The most powerful Image processors are capable of stand-alone operation 
with a keyboard and mass storage devices controlled by an on-board micro- 
processor. More commonly, the Image processor will be Interfaced to a host 
mini-computer for additional computational and I/O capabilities. 

Refresh Memory 

The Image data resides In refresh memory that Is typically dual-ported 
RM1 arranged In arrays of 8-b1t pixels for 256 Intensity levels per pixel. A 
nx}nochrome CRT requires one refresh array, and a color CRT Is refreshed from 
3 arrays, one each for red, green, and blue. The size of the Image data 
base Is limited only by the address snace. This varies amnno manufacturers 


- 398 - 


and Is usually mmra than can bn disolavnd at mica bacausa of an^llfler 
bandwidth limitations In tha CRT. Scrolling ovarcomas this limitation by 
allovrtng tha usar to romn at will through the entire rafr^h mmnory. 

In addition to Image refresh nwmory there Is usually one or more 
channels of graphic overlay memory. Histograms, vectors arvl alphanumerics 
are stored In this memory and combined at the processor output by pixel 
replacmiwnt or color combination so they are visible against the Image back- 
ground. 

PIpellrie Processors 

Refresh memory contents are read out and processed before conversion to 
analog video signals. The use of function table look-ups and high speed 
arithmetic hardware Is necessary to maintain video refresh rates. Pipelined 
processing allows an operator to scroll, zoom on a specified area without 
altering refresh memory contents, <^ta1n pixel statistics, and perform Image 
combination arithmetic In one or a few refresh times. A feedback loop allows 
processed output to be written back Into refresh meimiry so that more complex 
Iterative functions such as spatial convolution and filtering can be accom- 
plished In a few seconds. 

Displays 

The display device for high resolution monochrome or color monitors uses 
refresh rates of 30 Hz Interlaced or 60 Hz non-interlaced. Some Image pro- 
cessors accept external sync for use with video discs or digitized TV camera 
Input. The highest spatial resolution In both nmnochrome and color monitors 
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ojrrently available Is 1024 x 1024 pixels. Some of the most recently Intro* 
duced Image processors will handle the higher data rates for these monitors 
as well as stamiard 512 x 512 pixel monitors. Three Image processors-*the 
C(xnta1 Vision one/20, the Oe Anza IP 5000 and the I^S Model 70— meet the 
requirement of the Oasis syston. A comparison of the more Important features 
of these Image processors Is summarized In Table 4.4.1, and In the following 
paragraphs . 

jntal Vision one/20: 

- Refresh memory can be expanded to 134 Mbits. Menory configuration Is 
dynamic and under firmware control, 

- Can accomodate up to 4 Independent user stations. 

• Multiple Images In refresh memory can be displayed In rapid succession 
resulting In a "loop movie" effect. 
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De Anza IP 5000: 

- Good processing and feedback loop. 

- 3 X 3 convolution In less than one second. 


- Feediack Is routable to any bits of any refresh memory by defining a 
32-b1t mask. 


I^S Model 70: 

- Extensive software Is available, such as utilities for file management and 
Image rocessor ccMitrol and a library of Image enhancement and transform* 
atlon routines. 

- A feedback ALU does 16*b1t arithmetic on Images. 

- Mlnlnxxn and maximum pixel values and other pixel statistics are continuously 
avallabe. 
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I S Model 70 

extensive image processing 
and utility library. DEC 
compatible* mostly Fortran. 

non-destructive pixel repli- 
cation. x2,4*8 magnification 

up-down, left-right In 2-pixel 
i ncrements 

rectangular portions of 2 or 4-pixel 
increments 

I/O up to 2M pixels/sec 
1 to 12 512 X 512 &>bit images 
addressable by row or column. 
Readback to host. 

8 512 X 512 1-bit overlays. 
Scrollable. 

trackball, tablet, function 
buttons, joystick 
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Appendix AS.l 

Experiments on Observer Detection and Recognition 
of Simulated Signals In Noise 

Perhaps the most critical Information we desire concerning the human 
observers Is an understanding of his/her i^lllty to detect weak signals amidst 
background noise. There Is an exter^lve literature In psychology on Uris 
problem, but most of this work deals with the case where the signal character- 
istics are known by both the s inject and the ex|!^r1n%nt (see ^ts. Green, 
Getty, and Swets, 1978 for examples In the visual modality). The challenging 
»pect of the SETI problem Is that nobody has a firm idea concerning the 
precise nature of the signal. Moreover, It appears that little research has 
been done using visual noise fields similar to the ones anticipated In the 
present context (except Swets et al., 1978). For these reasons, then, we 
conducted some preliminary experiments with simulated visual displays. 
Observers made estimates of their confidence that a signal was present under 
a variety of conditions, spanning several signal types at various signal to 
noise ratios, as well as complete noise fields. Further, we present data to 
a new technique whereby observers draw potential signals on a matrix repre- 
senting the visual display. Analysis of such drawings may permit us to Infer 
the accuracy of localizing detections, and the types of "pseudo” signals 
observers perceive In noise fields. 

Observers. The participants were 14 members of the summer study program 
and 4 other professionals at the NASA- Ames research laboratories. They each 
participated on a voluntary basis, and were tested Individually. 

Equipment . The displey slimilatlons were generated by a Hewlett-Packard 
9800 computing system consisting of two primary components: the 9825A 
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calculator and a black and white video display unit. The video display 
was built especially by Hewlett Packarti for the SETI program. It presents 
a 128 X 128 pixel noise field with 256 (8 bit) levels of Intensity* The 
entire screen must be erased before the next picture is written; refresh 
tin« for the screen is 30 frames per second. 

Displays . The noise was generated over H levels of amplitude 
approximating a Reyleigh distribution. These amplitude values then were 
squared to approximate an exponential distribution of pixel intensities 
on the screen. This noise field Is what the observer saw in all d1spla»ys 
(different noise fields were used for each presentation). 

Signals were superimposed on the noise fields giving the impression 
of a pattern of brighter pixels. Average signal to noise ratios (SNR) 
were .4, .6, .8, 1, 1.2, and 1.4. 

Experimental design . Three patterns were employed at each SNR: straight 
line, wavy line, and pulse. The shape and calculation of these patterns 
are presented in Figure A5.1.1 (panels a, b, c). Examples of each pattern 
at the highest SNR, as seen on the screen, are shown in Figure /tt.1.2. 

The starting point of the signal was randomly selected from positions 
across the top edge of the screen and extended to the bottom edge. In 
addition to the signal displays (6 presentations for each of 3 signal 
conditions, for a total of 18) six noise field displays (i.e., without 
signals) were used. Hence, there were 24 displays presented to each observer. 
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Procedure. The 24 displays first were presented In a random order 
for approximately 10 second each. This ii^ase was Introduced to give the 
observer an Idea about the range and typo of displays. Next» the series 
was repeated and the observer gave a confidence rating from 1 to 10 con- 
cerning the presence of a signal anywhere In the display. Part of the 
Instructions for this phase read: "We want you to estimate your confi- 
dence that a signal (of any shape, size, or type) Is present. Give a 
number from 1 to 10 where 1 represents minimum confidence and 10 represents 
maximum confidence." The procedure was self-paced; the display was changed 
automatically after the observer's estimate was recorded by the experimenter. 

In the third phase of the experiment, the series was repeated and 
confidence ratings (Stained. In addition, observers drew potential signals 
on a sheet of graph paper. This page was divided Into 64 equal "boxes" 
or zones (l.e., 8 x 8 matrix). The Instructions for this phase read (In 
part; 

"In this last phase first give a confidence rating for each display. 

Don't try to recall what number you gave last time. Make each 
judgnent Independent of earlier ones. After giving a confidence 
rating for a display, draw potential signals on this graph paper. 
Precision Is not Important, just try to localize the perceived 
signal as best you can. For each display you must draw at least 
cxie signal regardless of your conflctence rating. In addition, you 
may draw other signals for the same display. These can be short 
lines, pulses, or other shapes, located anywhere in the display." 

The experiment was conducted under low room illumination which 

produced satisfactory visual contrast levels for the displays. The 

entire set of observations for a single session took approximately 40 

minutes. 
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Results and discussion 


Preliminary analysis suggested that there was minimum difference 
between confidence ratings on the two trials (phases), so these data 
were combined In further analyses. The means and standard deviations of 
the confidence ratings were calculated for each of the 24 displays* The 
results are presented in Table A5.1.1 (as a function of SNR and pattern type: 
line, wavy, pulse). The mean data are presented graphically in Figure AS. 1.2. 

Turning first to the mean ratings, there are obvious differences in 
an observer's confidence of his ability to detect the three types of 
patterns. The average rating for the noise fields Is approximately 2.5 
on the 10 point scale. Using this value as a lower referent, linear 
patterns are first detected at a SNR of .8; pulses and wavy lines at a 
higher value (1.0). Hence, by this measure, an observer's absolute 
detection threshold is lowest for straight lines. Further, once a 
straight line is detected, its presence is obvious, as indicated by the 
sharp increase in confidence at SNR ® .8. On the other hand, the confi- 
dence ratings for the wavy lines and pulses increase gradually, and level 
off at a low value than for straight lines. In short, straight lines are 
the easiest to detect; pulses and wavy lines are about equal in detectability 
across the values of SNR tested. 

The standard deviations also depend on the signal pattern. In general, 
the standard deviations (see Table AS. 1.1) are larger for the pulses and wayy 

lines than for the straight lines. The standard deviations also vary with 
SNR; greater variability was obtained for the higher SNR values with the 
exception of the straight lines, where variability drops dramatically at 
large SWl values. 
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The finding of an Increase In varlabllUy with higher wean ratings at 
first glanw appears paradoxical. Hox^ver, examination of the data of 
Individual observers explains this contradiction. What appears to have 
happened here was that observers split Into tuo distinct groups; one 
group (^finitely saw the signal* the other appeared to miss It completely. 

Hence, the combined data are highly variable. 

The signal drawing data were analyzed by calculating the percentage 
of hits and false alarms for each stimulus pattern. A hit 1$ defined as 
any mark drawn In a cell of the 8x8 matrix that In fact contained part 
of a signal. The percentage of hits Is the total number of correctly 
marked cells for all 18 observers divicted by the total signal cells 
(multiplied by the number of observers). Since this measure by Itself may 
or may not reflect accuracy (for example, a perfect score would result If 
observers marked every cell of the matrix). It Is necessary to compute 
the false alarms as well. The percentage of false alarms is defined as 
the total number of marked cells, where a signal was not present, divided 
by the total number of non-signal cells in the matrix (iiwl tipi led appro- 
priately to obtain a single group score). 

The percentage of hits Is shown In Figure A5.1.3 as a function of signal 
to noise ratio for each of the three signal types. The trend of the 
results is very similar to the confidence ratings presented in Figure AS. 1.2. 

The h1*s for the linear signal show a dramatic increase at SNR = .8; whereas 
the trend Is more gradual (and In fact, non-mono tonic) for wavy lines and 
pulses. 

The false alarm results are shown In Figure AS. 1.3. For each signal pattern 
the false alarm rate Is at or below the rate for noise alone (- 15-20%). 
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The linear signal yields the lowest false alarm rate. Quite obviously, 
the hit results are not artifactually due to a high percentage of the matrix 
being marked Independent of signal presence. 

Histograms were constructed to represent the location of drawn signals 
aggregated for all observers. Figure A5.1.7 shows the frequency of drawn marks 
for the linear pattern at the highest SNR (1.4). It Is clear t^at observers 
show substantial agreement on the location of the signal. (For purposes of 
representing the data, the size of the matrix on the histogram Is larger than 
the matrix used In the experiment.) 

Figures A5.1.8 and 9 present results for two noise fields. The Inter- 
esting aspect of these histograms is the apparent differences. In Fig. A5.1.P. 
the agreement Is minimal among observers concerning the location of potential 
signals. However, a fair amount of agreement is evident In Figure AS. 1.9. 

The latter finding Is Intriguing since It suggests that characteristics of a 
noise display lead people to Infer the presence of signals, even though their 
confidence In signal presence is rather low (the mean confidence ratings for 
the two patterns are approximately the same). 

The findings here must be considered tentative. Further analysis of 
individual differences among observers, with replications using larger 
sample sizes, should help to clarify and extend the present study along 
avenues that invite direct appl.cati c to the problem of signal detection 
and recognition. 
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Table A5.1.1 - Composite Statistics for Confidence Ratings 
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Figure AS. 1.1 (b) 
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Appendix A7.1 


Experimental Procedure Outlines for Several Feasibility Studies 
of Non-Astrophysical Uses of the Multichannel Spectnxn Analyzer 
and Signal Detection System 

Two experimental outlines for preliminary feasibility studies of non-astro- 
physical uses of the multichannel spectrun analyzer and signal detection system 
are presented In this appendix. Each of these studies can be Implemented with 
off-the-shelf hardware, however the studies will provide Insight Into the 
nature and depth of the results that could be obtained by Implementing the 
full system as proposed In this study. 

A Source >techan1sm Study 

In this study, the apparent downward frequency shift which accompanies 
growth of the acoustic emission source will be Investigated. 

Parallel pi peds or right circular cylinders will be cut from a single 
crystal of quartz (SIO 2 ). A stress riser hole will be drilled perpendicular 
to the long axis of the sample, which will allow a crack to stably propagate 
from the stress riser hole when the sample Is loaded. Zirconium titanate 
piezoelectric transducers will be bonded to the sides of the sample with 
conductive epoxy. The transducer outputs will be amplified by an 80-110 dB 
preamplifier and recorded on an analog Instrinient tape recorder. A video 
camera will be used to monitor crack velocity. The time gate signal on the 
tape recorder will be used as an Input to the camera In order to Insure 
synchronization of the camera and recorder. 

Each sample will be loaded perpendicular to the stress rider hole and 
the long axis of the srnnple until the crack propagation stress Is reached. 


- 422 - 
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when crack propagation Is Initiated, the sanple will be statistically 
loaded, All acoustic emissions during crack propagation will be reccrded 
on magnetic tape. The power spectrum as a function of crack length will 
be analyzed with a spectnmi analyzer. An off-the-shelf analyzer can be 
ified” for the Initial study because while tte frequency resolution over the 
entire spectrum of emisslorm Is poor, high frequency resolution can be 
obtained If the bandwidth of the analyzer Is re<kjced. The spectrum analyzer 
can be used In narrowband mode to examine the high and low ends of the 
spectrum by replaying the tape Into the spectrum analyzer. In order to deter- 
mine the relationship between crack length and frequent^ shift or crack 
vel(K;1ty and frequency shift. The sample and experimental configuration are 
shown 1n Figure 7.1.1. 

Stress Field Measurement 

Stress field measurement using the Kaiser effect as a stress measurement 
tool will be Investigated. The Kaiser effect Is basically an acoustic memory; 
If a rock has been loaded to 70% of Its compressive strength (with acoustic 
emission activity from roughly 40 - 50% of the compressive strength up to 
failure) acoustic emission activity will not occur, upon reloading, until 
the previous load Is exceeded. 

Parallel pi peds or right circular cylinders will be cut from a relatively 
homogeneous rock such as sandstone. Each sample will be loaded axially to 
70% of the compressive strength of the material and then unloaded. A piezo- 
electric transducer will be bonded to the side of the sample with conductive 
epoxy. Each sample will then be reloaded along the same axis up to 85% of the 
compressive strength of the rock, and the acoustic emissions during reloading 
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will be monitored. A series of sample sets with varying durations between 
Initial loading and reloading can be run In order to <teterm1ne the nature 
and longevity of the Kaiser effect. Samples can also be axially deformed 
under confining pressure In order to determine If the three dimensional 
stress field can be evaluated with the Kaiser effect. The output of the 
transducer will be amplified by an 80-120 <fi pre-amplifier and recorded on 
an analog Instrunent tape recorder. The signal gating track on the tape 
recorder will be used to enter the stress level data and time signal. The 
tape win be replayed through a spectrum analyzer. An off-the-shelf 
spectrum analyzer will be used. The gross characteristics of the spectrum can 
be determined with the spectrum analyzer In wideband mode and then the spectrum 
can be re-examined section by section with the spectrum analyzer In the narrow- 
band mode. The multichannel spectrum analyzer and signal detector proposed 
in this study has far greater sensitivity and frequency resolution than an 
off-the-shelf spectrun analyzer, however the feasibility of using the system 
to determine stress levels In fault zones can be Investigated with off-the- 
shelf equipment. The sample and experimented configuration are shown In 
Figure 7.1.2. 
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